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We  study  how  dynamical  decoupling  (DD)  pulse  sequences  can  improve  the  reliability  of  quantum 
computers.  We  prove  upper  bounds  on  the  accuracy  of  DD-protected  quantum  gates  and  derive 
sufficient  conditions  for  DD-protected  gates  to  outperform  unprotected  gates.  Under  suitable  con¬ 
ditions,  fault-tolerant  quantum  circuits  constructed  from  DD-protected  gates  can  tolerate  stronger 
noise,  and  have  a  lower  overhead  cost,  than  fault-tolerant  circuits  constructed  from  unprotected 
gates.  Our  accuracy  estimates  depend  on  the  dynamics  of  the  bath  that  couples  to  the  quantum 
computer,  and  can  be  expressed  either  in  terms  of  the  operator  norm  of  the  bath’s  Hamiltonian  or 
in  terms  of  the  power  spectrum  of  bath  correlations;  we  explain  in  particular  how  the  performance 
of  recursively  generated  concatenated  pulse  sequences  can  be  analyzed  from  either  viewpoint.  Our 
results  apply  to  Hamiltonian  noise  models  with  limited  spatial  correlations. 
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I.  INTRODUCTION 

Two  well-known  methods  for  protecting  quantum  sys¬ 
tems  from  noise  are  dynamical  decoupling  (DD)  and 
quantum  error  correction  (QEC).  In  DD,  pulses  are  ap¬ 
plied  to  the  protected  system,  chosen  so  that  the  dam¬ 
aging  effects  of  the  noise  nearly  average  away.  In  QEC, 
protected  logical  qubits  are  encoded  as  collective  states 
of  many  physical  qubits,  chosen  so  that  damage  due  to 
noise  can  be  detected  and  reversed. 

Each  method  has  advantages  and  disadvantages.  On 
the  plus  side,  resource  requirements  for  DD  are  relatively 

modest  [j,  i  i  i  a  a  a  a  a  0,  ill.  Only  uni¬ 
tary  control  operations  need  be  applied  to  the  system; 
there  is  no  need  to  perform  measurements  or  to  replace 
used  ancillary  qubits  with  fresh  qubits.  Furthermore,  a 
single  physical  qubit  suffices  for  each  protected  logical 
qubit,  and  protected  quantum  gates  can  be  implemented 
using  relatively  short  sequences  of  pulses.  DD  pulse  se¬ 
quences  are  simple  enough  that  experiments  on  a  wide 
variety  of  quantum  systems  have  convincingly  demon¬ 
strated  the  effectiveness  of  DD  [3,  HH,  d,  d-d,  HH.  d, 
[2TI  d-  d,  d-  HH  •  On  the  minus  side,  DD  is  effective 
only  against  low-frequency  noise,  slowly  varying  on  the 
time  scale  set  by  the  interval  between  pulses,  and  its  ef¬ 
fectiveness  is  intrinsically  limited  by  imperfections  in  the 
timing  and  shape  of  the  pulses.  Furthermore,  DD  is  not 
an  efficient  scheme  for  flushing  entropy  from  the  system, 
if  no  qubits  are  replaced  or  refreshed;  thus  it  seems  that 
DD  does  not  by  itself  provide  a  feasible  route  to  scalable 
quantum  computing. 

On  the  plus  side,  the  quantum  accuracy  threshold  the¬ 
orem  establishes  that  QEC,  through  judicious  design  of 
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fault-tolerant  gadgets  acting  on  code  blocks,  suffices  for 
accurate  simulation  of  arbitrarily  long  quantum  compu¬ 
tations,  if  the  noise  is  sufficiently  weak  and  reasonably 
local  d  d,  d,  d,  d  d  32,  d.  QEC  can  succeed 
against  high-frequency  noise,  where  DD  methods  fail. 
On  the  minus  side,  though,  the  resource  requirements 
for  QEC  are  quite  daunting.  A  ready  supply  of  fresh 
qubits  is  necessary;  furthermore,  the  number  of  physical 
system  qubits  needed  to  encode  one  logical  qubit,  and 
the  number  of  physical  gates  needed  to  execute  one  log¬ 
ical  gate,  can  be  substantial.  Because  of  the  complexity 
of  fault-tolerant  quantum  computing  protocols,  and  be¬ 
cause  these  protocols  work  only  when  the  noise  is  already 
quite  weak,  experiments  showing  that  QEC  can  suppress 
naturally  occurring  noise  have  not  yet  been  performed. 

Because  of  their  complementary  strengths,  DD  and 
QEC  used  together  should  be  more  effective  at  protect¬ 
ing  quantum  computers  from  noise  than  either  used  by 
itself.  Combining  these  two  methods  of  error  control  is 
the  topic  of  this  paper.  Hybrid  schemes  combining  DD 
with  QEC  have  been  proposed  previously  [3l:d,d|>  and 
even  studied  experimentally  [3711 .  Our  new  contribution 
is  a  systematic  investigation  of  the  advantages  of  hybrid 
schemes  for  fault-tolerant  quantum  computing,  including 
rigorous  bounds  on  performance. 

Our  main  technical  results  are  analytic  expressions  for 
the  “effective  noise  strength”  of  quantum  gates  imple¬ 
mented  using  DD  pulse  sequences.  The  effective  noise 
strength  is  (an  upper  bound  on)  the  deviation  in  the  op¬ 
erator  norm  of  the  noisy  protected  gate  from  an  ideal 
gate.  In  the  Hamiltonian  noise  models  that  we  consider, 
the  logarithm  of  the  operator  realized  by  a  DD-protected 
gate  can  be  expanded  as  a  power  series  (the  Magnus 
expansion)  in  the  noise  Hamiltonian;  we  derive  upper 
bounds  on  the  sum  of  this  series,  obtaining  formulas  for 
the  effective  noise  strength  in  terms  of  parameters  in  the 
noise  Hamiltonian.  We  find  such  bounds  both  for  general 
DD  pulse  sequences,  and  also  for  pulse  sequences  that 
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have  an  approximate  time-reversal  symmetry;  in  the  lat¬ 
ter  case  the  terms  of  even  order  in  the  Magnus  expansion 
are  heavily  suppressed. 

Armed  with  our  formulas  for  the  effective  noise 
strength,  we  can  derive  a  “noise-suppression  threshold 
condition”  on  the  noise  parameters.  When  this  condi¬ 
tion  is  satisfied,  DD-protected  gates  are  more  accurate 
than  unprotected  gates.  We  can  also  compare  fault- 
tolerant  quantum  circuits  composed  from  DD-protected 
gates  with  circuits  composed  from  unprotected  gates.  In 
either  case,  we  can  express  the  “accuracy  threshold  con¬ 
dition”  on  the  noise  parameters.  When  this  condition  is 
satisfied,  quantum  computation  is  scalable  —  accurate 
computations  of  arbitrary  size  can  be  performed  with 
a  reasonable  overhead  cost.  Typically,  improvements  in 
gate  accuracy  achieved  by  DD  mean  that  more  noise  can 
be  tolerated  by  QEC  combined  with  DD  than  by  QEC 
alone,  and  that  invoking  DD  can  reduce  the  overhead 
cost  of  QEC. 

Our  expressions  based  on  the  Magnus  expansion  for 
the  effective  noise  strength  depend  on  the  operator  norm 
of  the  Hamiltonian  that  governs  the  internal  dynamics  of 
the  quantum  computer’s  environment  (the  “bath”),  and 
the  results  are  not  useful  if  this  norm  is  large.  But  we 
also  describe  an  alternative  method  of  analysis  yielding 
expressions  for  the  effective  noise  strength  in  terms  of 
the  frequency  spectrum  of  the  bath  correlations.  Results 
derived  by  this  method  can  be  applicable  even  if  the  bath 
Hamiltonian  has  unbounded  norm,  as  long  as  the  typical 
bath  frequencies  are  sufficiently  small. 

The  performance  of  DD  can  sometimes  be  enhanced 
by  using  recursively  generated  “concatenated”  pulse  se¬ 
quences  0-  Adding  an  extra  “level”  to  the  recursive 
hierarchy  further  suppresses  the  effective  noise  Hamilto¬ 
nian,  but  at  the  cost  of  lengthening  the  pulse  sequence, 
and  the  minimal  effective  noise  strength  is  achieved  by 
choosing  the  level  that  optimizes  this  tradeoff.  We  an¬ 
alyze  concatenated  DD  sequences  and  estimate  the  op¬ 
timal  effective  noise  strength,  using  both  our  bounds  on 
the  Magnus  expansion  and  the  correlation  function  view¬ 
point. 

Our  analysis  of  the  improvement  in  gate  accuracy  that 
can  be  achieved  by  combining  DD  and  QEC  applies  only 
to  a  special  class  of  Hamiltonian  noise  models.  These 
models  satisfy  what  we  call  the  “local-bath  assump¬ 
tion”  which  limits  the  spatial  correlations  in  the  noise. 
Whether  our  results  can  be  extended  to  more  general 
noise  models  that  violate  this  assumption  is  an  intrigu¬ 
ing  open  question. 

We  formulate  our  noise  model  in  Sec.  HI1  In  Sec.HIDand 
Sec.  UV]  we  review  and  develop  some  of  the  tools  we  need 
to  analyze  the  performance  of  DD  pulse  sequences  and 
fault-tolerant  quantum  circuits  using  the  Magnus  expan¬ 
sion.  We  state  our  central  results  relating  the  effective 
noise  strength  of  DD-protected  gates  to  the  properties  of 
the  noise  Hamiltonian,  and  their  implications  concerning 
the  noise-suppression  threshold  and  accuracy  threshold, 
in  Sec.  El  then  we  apply  these  results  to  some  specific 


pulse  sequences  in  Sec.  ED  Derivations  of  these  results 
are  contained  in  Sec.  IVIII  and  the  Appendices.  We  ana¬ 
lyze  concatenated  DD  in  Sec.  IVIIII  In  Sec.  IIX1  we  em¬ 
phasize  that  the  effective  noise  can  be  related  to  intensive 
quantities  that  are  independent  of  the  spatial  volume  of 
the  bath,  and  in  Sec.  El  we  express  the  noise  strength  in 
terms  of  properties  of  bath  correlations.  Sec.  ED  contains 
our  conclusions. 


II.  NOISE  MODEL 
A.  Noise  Hamiltonian 

We  denote  by  S  the  system  consisting  of  all  of  the 
qubits  in  our  quantum  computer,  and  we  describe  the 
noise  acting  on  S  using  a  “noise  Hamiltonian”  H ,  which 
governs  the  joint  evolution  of  the  system  and  its  envi¬ 
ronment,  the  bath  B.  During  a  computation,  the  Hamil¬ 
tonian  also  contains  time-dependent  terms  that  realize 
quantum  gates  acting  on  the  qubits,  but  for  now  con¬ 
sider  the  case  where  there  are  no  gates;  then  H  may  be 
expressed  as 

H  =  Hb  +  Herr.  (1) 

Here 

Hb  =  Is  <g>  .Bo  (2) 

describes  the  “free”  evolution  of  the  bath  (how  it  would 
evolve  if  it  were  not  coupled  to  the  system)  while 

HeiI  =  Hg  +  Hsb  (3) 

includes  all  the  terms  in  H  that  act  non-trivially  on  the 
system.  The  term 

Hg  =  S0®  IB  (4) 

describes  the  unperturbed  free  evolution  of  the  system; 
Hsb  contains  terms  coupling  the  system  to  the  bath,  and 
also  perhaps  other  noise  terms  that  act  nontrivially  only 
on  the  system. 

Though  for  some  purposes  it  may  seem  natural  to 
transform  away  Hg  by  working  in  the  interaction  pic¬ 
ture  (that  is,  by  considering  the  motion  of  the  system 
relative  to  the  rotating  frame  determined  by  Hg),  we 
have  included  Hg  in  the  term  Herr  that  represents  the 
noise  acting  on  the  system.  Our  reason  is  that  the  DD 
sequences  we  study  are  designed  to  remove  the  effects 
of  all  “always-on”  terms  in  the  Hamiltonian  that  act  on 
the  system,  i.e.,  not  just  Hsb  but  also  the  free  evolution 
term  Hg.  We  may  exclude  from  the  noise  Hamiltonian 
H  a  term  proportional  to  Is  <8>  Is,  which  just  shifts  the 
zero  point  of  the  energy  and  has  no  dynamical  effect. 
Alternatively,  we  can  choose  the  coefficient  of  this  term 
to  optimize  our  estimate  of  the  accuracy  threshold;  for 
example,  we  may  by  convention  choose  H  to  be  traceless. 
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Now  consider  modeling  the  noise  during  a  nontrivial 
quantum  computation.  A  computation  is  a  circuit  con¬ 
taining  three  types  of  operations:  qubit  state  prepara¬ 
tions,  unitary  quantum  gates,  and  qubit  measurements. 
We  model  a  noisy  preparation  as  an  ideal  preparation 
followed  by  evolution  according  to  H  for  a  specified  time 
interval,  and  we  model  noisy  measurements  as  ideal  mea¬ 
surements  preceded  by  evolution  according  to  H.  We  as¬ 
sume  that  quantum  gates  are  executed  using  short,  hard 
pulses,  where,  as  in  some  experiments,  the  time  interval 
between  consecutive  pulses  is  much  longer  than  the  pulse 
width.  Each  pulse  has  its  support  in  a  narrow  interval  of 
width  6,  and  we  denote  by  tq  the  sum  of  the  pulse  width 
and  the  pulse  interval  (see  Fig.  [[]),  where  6  <C  r0.  To  be 
concrete,  we  will  sometimes  assume  that  the  pulses  are 
perfectly  “rectangular”  —  i.e .,  have  vanishing  rise-time 
and  fall-time.  However,  the  details  of  the  pulse  shape  are 
not  really  used  in  our  analysis;  rather,  all  that  matters  is 
that  the  pulse  is  confined  to  a  narrow  interval  (and  even 
this  assumption  will  be  relaxed  in  Sec. I VI  Cl).  We  use  the 
same  noise  Hamiltonian  H  to  describe  the  noise  both  dur¬ 
ing  a  pulse  and  during  the  interval  between  pulses.  We 
neglect  errors  in  the  timing  and  strength  of  the  pulses; 
these  are  typically  small  in  practice  because  the  pulses 
are  controlled  by  accurate  classical  circuitry. 


pulse  width 


pulse  interval 

FIG.  1:  Parameters  characterizing  a  sequence  of  uniformly 
spaced  rectangular  pulses:  <5  is  the  pulse  width,  and  to  —  S  is 
the  interval  between  the  end  of  one  pulse  and  the  beginning 
of  the  following  pulse. 


B.  Local-bath  assumption 

To  further  simplify  our  analysis,  we  make  an  additional 
assumption  about  the  noise,  which  we  call  the  local-bath 
assumption  [3(j,  illustrated  in  Fig.  [2]  Let  us  use  the  term 
“location”  to  speak  of  an  operation  in  a  quantum  circuit 
that  is  performed  in  a  single  time  step  —  a  location  may 
be  a  single-qubit  or  multi-qubit  gate,  a  qubit  preparation 
step,  a  qubit  measurement,  or  the  identity  operation  in 
the  case  of  a  qubit  that  is  idle  during  the  time  step.  Each 
time  step  has  duration  to'  thus  to  =  Ntq  if  N  equally 
spaced  pulses  are  applied  at  a  particular  location.  For  a 
specified  location  labeled  by  a,  let  Qa  denote  the  set  of 
qubits  that  participate  in  the  operation  applied  at  that 
location  (for  example,  a  pair  of  qubits  if  the  operation  is 
a  two-qubit  gate).  Under  the  local-bath  assumption,  the 


noise  Hamiltonian  can  be  expressed  as 

h  =  J2h^  (5) 

a 

where  the  sum  is  over  all  locations  occurring  at  a  partic¬ 
ular  time  step,  and  where  for  any  two  distinct  locations 
a  and  b  in  that  time  step,  Ha  and  Hi,  act  not  only  on 
disjoint  sets  of  system  qubits  but  also  on  distinct  baths. 
That  is,  we  may  write 

Ha  =  Hs,a  +  Berr,a;  (6) 

with 

H B ,a  =  Is, a  0  -Bo, a, 

Hprr.  a  —  E  0  Ba>a,  (7) 

a 

where  the  operators  Saja  act  on  Qa,  and  where,  for 
a  ytz  b,  the  bath  operators  Boa  and  Baa  associated  with 
location  a  commute  with  the  bath  operators  Bqi,  and 
Ba  f,  associated  with  location  b.  Each  Ha  is  assumed  to 
be  time-independent  during  the  duration  of  location  a 
(this  assumption  is  helpful  because  DD  pulse  sequences 
are  typically  designed  to  cope  with  a  time-independent 
noise  Hamiltonian),  but  Hamiltonians  at  different  loca¬ 
tions  need  not  be  the  same. 

The  local-bath  assumption  allows  us  express  the  time 
evolution  operator  for  a  single  time  step  as  a  product 
of  unitary  operators,  each  associated  with  one  particular 
location,  and  to  analyze  the  effectiveness  of  the  DD  pulse 
sequence  for  each  location  separately.  Though  our  local- 
bath  model  may  be  a  reasonable  approximation  to  the 
noise  in  actual  systems,  interactions  between  qubits  (and 
their  associated  baths)  at  different  locations  are  surely 
present  at  some  level,  and  in  Sec.  IIXI  we  will  comment 
further  on  how  our  analysis  is  affected  when  the  local- 
bath  assumption  is  relaxed. 


G, 

g2 

- - : 

g5 

G6 

<A 

: — m — 

G8 

t0  time 


FIG.  2:  Illustration  of  the  local-bath  assumption.  Blue  lines 
are  system  qubits  and  grey  lines  are  bath  subsystems.  Each 
blue  rectangle  is  a  quantum  gate,  and  its  associated  light  grey 
rectangle  represents  the  joint  evolution  of  system  qubits  and 
bath  subsystems  that  are  coupled  by  the  noise  Hamiltonian. 
The  light  grey  rectangles  do  not  overlap,  indicating  that  gates 
executed  in  parallel  act  not  only  on  distinct  system  qubits  but 
also  on  distinct  bath  subsystems. 
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C.  Noise  parameters 

To  characterize  the  noise  strength,  it  is  useful  to  intro¬ 
duce  the  parameters  /3,  J,  e  defined  by: 


P  =  max  || Hb, a ||, 

a 

(8) 

J  =  max  1 1  Hq rr  a  1 1  ? 

a 

(9) 

6  =  (3  +  J  >  max  \  \Ha  || . 

(10) 

a 


The  norm  here  is  the  sup  operator  norm 


Pll  =  sup 

{]«>} 


PIPI 

linn 


(ii) 


where  the  vector  norm  is  the  Euclidean  norm  |||u)||  = 
y/ Actually,  our  results  concerning  the  effective¬ 
ness  of  DD  pulse  sequences  apply  for  any  norm  that  is 
unitarily-invariant  (and  therefore  also  submultiplicative 
0),  but  the  operator  norm  will  be  used  to  relate  these 
results  to  the  accuracy  threshold  for  fault-tolerant  quan¬ 
tum  computing  [30,  [3lj .  We  are  typically  interested  in 
the  case  where  the  noise  is  weak,  in  the  sense  that  the  di¬ 
mensionless  parameter  ero  is  small  compared  to  one  (and 
hence  also  Jt0  <C  1  and  /3tq  -C  1).  We  will  derive  bounds 
on  the  performance  of  DD-protected  quantum  gates  ex¬ 
pressed  in  terms  of  these  small  quantities,  and  also  in 
terms  of  the  dimensionless  pulse  width  S/tq  <C  1. 

For  our  analysis  of  fault-tolerant  circuits,  we  will  find 
it  convenient  to  assume  that  measurements  and  prepara¬ 
tions  are  at  least  as  fast  as  pulses,  i.e.,  can  be  executed 
in  time  at  most  8.  But  in  Sec.  IV  Dl  wc  will  discuss  how 
to  interpret  our  results  if  measurements  or  preparations 
take  much  longer  than  pulses. 


III.  TOOLS 

Let  us  next  introduce  some  tools  for  analyzing  the  noise 
suppression  arising  from  DD  techniques.  We  focus  here 
on  the  foundations  of  our  analysis  based  on  the  Magnus 
expansion;  further  background,  needed  for  our  analysis 
based  on  bath  correlation  functions,  will  be  discussed 
in  Sec.  m  We  also  provide  here  a  brief  discussion  of 
fault  tolerance,  including  the  notion  of  the  effective  noise 
strength  at  a  circuit  location,  a  central  quantity  in  our 
analysis. 


and  bath  evolve  according  to  H  +  Hc(t).  (In  our  noise 
model,  we  assume  that  the  same  noise  Hamiltonian  H 
applies  during  a  pulse  as  between  pulses,  while  recogniz¬ 
ing  that  this  assumption  is  really  an  idealization.)  The 
DD  sequence  can  be  described  using  either  Hc(t)  itself  or 
using  the  time  evolution  operator  Uc{t)  =  Uc(t,  0)  gener¬ 
ated  by  Hc(t). 

For  understanding  the  effects  of  the  control  Hamilto¬ 
nian,  it  is  convenient  to  use  the  interaction  picture  de¬ 
fined  by  Hc  (t) ,  also  known  as  the  toggling  frame  [l| .  The 
toggling- frame  density  operator  pss(t)  is  related  to  the 
Schrodinger-picture  density  operator  pss(t)  by 

PSB  if)  =  U(t,  0)/9SB(0)/7t(f,  0) 

=  Uc(t)pss(t)UKt ),  (12) 

where  U (t,  0)  is  the  evolution  operator  generated  by  the 
full  Hamiltonian  H+Hc(t).  Therefore  the  toggling-frame 
state  evolves  according  to 

PSB(t)  =  U(t,0)pSB(0)U\t,0),  (13) 

where  the  toggling-frame  time  evolution  operator 

U(t,0)  =  UUt)U(t,0)  (14) 

is  generated  by  the  toggling-frame  Hamiltonian 

m  =  U\{t)HUc{t).  (15) 

Since  Uc(t )  acts  nontrivially  only  on  the  system,  H(t) 
can  be  written  as 

H(t)=HB  +  HeTI(t),  (16) 

where  =  U%(t)HeiIUc(t)  is  the  toggling-frame  ver¬ 

sion  of  HelI.  Because  the  operator  norm  is  unitarily- 
invariant,  we  have  ||l/(t)||  =  ||H||  <  e  and  ||Herr(i)||  = 
||  Herr  ||  <  J- 

We  consider  cyclic  DD ,  where  Uc(t)  returns  to  the  iden¬ 
tity  (up  to  a  possible  irrelevant  overall  phase)  at  the  end 
of  a  cycle  taking  time  tun- 

Uc(tDD)  =  Uc(0)  =  I.  (17) 

Therefore,  at  the  end  of  the  cycle,  the  toggling-frame  and 
Schrodinger-picture  states  coincide. 


B.  Finite-width  pulses 


A.  Toggling  frame 

For  now,  disregard  that  we  want  to  do  computation, 
and  focus  instead  on  quantum  storage  —  the  original 
context  for  DD  methods.  In  the  absence  of  any  ex¬ 
ternal  control,  the  system  and  bath  evolve  under  the 
time-independent  noise  Hamiltonian  H.  A  DD  pulse  se¬ 
quence  is  realized  via  a  time-dependent  control  Hamilto¬ 
nian  Hc(t)  acting  only  on  the  system  so  that  the  system 


In  DD,  the  system  is  controlled  using  a  sequence  of 
pulses,  where  the  control  Hamiltonian  Hc[t)  vanishes  in 
between  the  pulses.  The  control  unitary  resulting  from  a 
sequence  of  R  pulses  can  be  expressed  as 

Uc  =  lPRlPR-1l...P2lP1l.  (18) 

where  P*,  is  the  unitary  achieved  by  the  fcth  pulse.  We 
have  inserted  the  identity  I  between  successive  pulses  to 
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indicate  the  time  intervals  during  which  Hc(t )  =  0.  For 
some  pulse  sequences,  including  the  ones  described  in 
Sec.  IVI1  all  pulse  intervals  have  the  same  duration,  but 
for  most  of  our  analysis  (excluding  some  of  the  discussion 
of  pulse- width  effects  in  Sec.lVII  ATI)  we  need  not  assume 
that  the  pulses  are  uniformly  spaced.  (It  is  known  that 
the  effectiveness  of  DD  can  sometimes  be  improved  by 
varying  the  spacing  between  pulses  [H,  [U,  [Hi,  S3,  EH. 
0.) 

If  the  pulses  are  rectangular  with  width  6,  then  we  may 
write 


Pk  =  exp  (~i6HPk),  (19) 

where  Hpk  is  the  time-independent  control  Hamiltonian 
that  is  turned  on  during  the  £;th  pulse.  If  the  fcth  pulse 
begins  at  time  sk,  then  the  control  unitary  during  the 
pulse  (t  G  [sk,sk  +  5))  is 

Uc(t)  =  exp  (—iAkHPk)  Uc(sk ) 

=  exp  (—iAkHPk)  Pk~i  ■  •  •  P2Pi,  (20) 

where  A*,  =  t  —  sk.  The  toggling-frame  Hamiltonian  H{t) 
can  be  written  as 


H{t)  =  U\{t)HUc{t) 


(21) 


iAkH 


(fe-i) 


fj(k-l)  15 


for  t  G  [Sfc_i  +6,Sk), 
for  t  G  [sk,sk  +  <5), 


where 


H{k~ i)  =  PM  ■  ■  ■  Pl-iHPk-i  ■  ■  ■  P‘2  Pi  ■  (22) 


time-dependent  Hamiltonian  Hm  (t)  is  equivalent  to  time 
evolution  generated  by  the  time-independent  effective 
Hamiltonian  He g-  =  ^fl(T). 

The  leading  terms  in  the  Magnus  expansion  are  (see 
for  example,  [441]  1 


fiiCT)  =  -*  [  ds  Hm(s),  (26) 

J  0 

1  rT  rsi 

^2  (?)  =  --  ds  1/  ds2[HM{si),HM(S2)},  (27) 

2  Jo  Jo 

rS2 

ds2  /  ds3  (28) 


n3(r) 


/  0 

rT 


ds  1 


(  [Hm(si),  [ HM(s2 ),  HM(S3)}} 
[Hm(s3),  [Hm(s2),Hm(si)]]  ). 


Higher-order  terms  can  be  computed  using  a  recursive 
formula;  see  Sec.lVIIland  Appendix lAl  In  general,  H„(T) 
is  the  time  integral  of  a  sum  of  ( n  —  l)-nested  commu¬ 
tators,  each  with  n  factors  of  Hm(J).  The  Magnus  ex¬ 
pansion  is  thus  an  infinite  series  in  HmT ;  a  sufficient 
condition  for  convergence  is  [H] 


dt  \\HM{t)\\  <  7 r. 


(29) 


For  cyclic  DD,  we  can  use  the  Magnus  expansion 
to  compute  the  toggling-frame  time  evolution  operator 
U(tr>D,0)  f°r  one  complete  cycle,  where  HM{t )  is  the 
toggling-frame  Hamiltonian  H[t)  =  U%(t)HUc(t)  and 
Uc(tDi))  =  I-  In  first  order  we  obtain 


In  the  case  of  cyclic  DD,  after  the  last  pulse  of  a  complete 
cycle  we  have  Uc  =  I  and  H  =  H . 


C.  Magnus  expansion 


^i(£dd) 


dt  H(t ) 


dt  Ul(t)HUc{t). 


(30) 


For  a  given  H  (t) ,  the  toggling- frame  time  evolution  op¬ 
erator  U (fpo ,  0)  can  be  computed  using  a  Magnus  expan¬ 
sion  0].  For  a  unitary  time  evolution  operator  Um^i  0) 
satisfying  the  Schrodinger  equation 

i%-UM(t,0)  =  HM(t)UM(t,  0),  UM( 0,0)  =  I,  (23) 
at 

determined  by  Hamiltonian  Hm{J ),  the  Magnus  expan¬ 
sion  at  time  T  is  an  operator  series 

OO 

fi(T)  =  0„(T)  (24) 

n—  1 

such  that 

Um(T,  0)  =  exp  [O(T)] ,  (25) 

and  is  nth  order  in  the  Hamiltonian  HMit).  Thus, 
for  the  fixed  time  T,  time  evolution  generated  by  the 


For  group-based  DD  schemes,  like  the  examples  we  will 
discuss  in  Sec.  EH  the  integral  Eq.  (l30l)  averages  H  over  a 
finite  group  Q  if  the  pulses  are  ideal,  projecting  H  into  the 
commutant  of  Q  Q.  If  Q  acts  irreducibly  on  the  system 
Hilbert  space,  the  commutant  contains  only  the  identity 
operator  acting  on  the  system,  and  therefore  fli(fDD) 
acts  nontrivially  only  on  the  bath.  In  that  case  we  say 
that  Oi(£dd)  is  a  “pure  bath”  term. 

We  say  that  a  DD  pulse  sequence  achieves  first-order 
decoupling  if  the  first-order  term  in  the  Magnus  expan¬ 
sion  for  /7c(£dd)  acts  trivially  on  the  system.  More 
generally,  the  sequence  achieves  mth-order  decoupling  if 
lln(fDD)  is  a  pure  bath  term  for  each  n  <  m.  In  our 
analysis  we  will  at  first  consider  pulse  sequences  that 
achieve  first-order  decoupling  for  ideal  zero-width  pulses 
(later  we  will  discuss  the  corrections  to  first-order  de¬ 
coupling  that  arise  when  the  pulses  have  nonzero  width, 
and  we  will  also  describe  “Eulerian”  pulse  sequences  that 
achieve  first-order  decoupling  even  when  pulse  widths  are 
nonzero).  In  particular,  these  pulse  sequences  have  the 
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property 

pt-DD 

/  dt  Herryo(t)  =  0,  (31) 

Jo 

where  the  subscript  “0”  on  Heirio(t)  indicates  that  the 
toggling-frame  Hamiltonian  HelI(t)  is  considered  in  the 
limit  6  — >  0,  while  holding  tq  fixed.  (Eq.  (1311)  implies 
that  there  is  no  term  in  HelTt q  that  acts  nontrivially  on 
the  system  and  commutes  with  Hc[t)  for  all  t  G  [O^dd]; 
such  terms  would  not  be  removed  by  the  DD  sequence  de¬ 
scribed  by  Hc{t).)  For  pulse  sequences  satisfying  Eq.  (1311) 
it  follows  from  Eq.  m  that  the  first-order  term  in  the 
Magnus  expansion  is 

/’tDD 

^i(£dd)  =  -i/  dt  H0(t) 

J  o 

rtr>r> 

=  -iHst-DB  -i  dt  HeTT,o{t) 

Jo 

=  —iHstuD,  (32) 

a  pure  bath  term,  when  S  =  0.  For  pulses  with  nonzero 
width  6,  first-order  decoupling  is  not  exact,  but  the  de¬ 
viation  of  Hi  (£dd)  from  a  pure  bath  term  is  0{5/tq)  and 
thus  small  when  the  pulses  are  sufficiently  narrow.  For 
suitably  designed  pulse  sequences  the  deviation  can  be 
improved  to  a  higher  power  of  8/tq  00. 

A  pulse  sequence  that  achieves  first-order  decoupling 
will  also  achieve  second-order  decoupling  if  H  is  time- 
symmetric:  i?(fDD  —  t)  =  H(t)  for  t  G  [0,  £dd]  ■  This 
condition  is  satisfied  provided 

Uc(tn-D-t)  =  VtUc(t),  (33) 

where  Vt  is  unitary  and  commutes  with  H  (for  example,  if 
Vt  =  e'^'I  is  a  phase).  In  fact,  when  H  is  time-symmetric, 
not  just  the  second-order  term,  but  all  even  terms  in  the 
Magnus  expansion  vanish,  as  we  show  in  Appendix  [Bl 

D.  Quantum  accuracy  threshold  theorem 

The  quantum  accuracy  threshold  theorem  establishes 
that  a  noisy  quantum  computer  can  operate  reliably  if 
the  noise  is  sufficiently  weak.  Under  the  local-bath  as¬ 
sumption  formulated  in  Sec.  El  the  operation  applied  at 
location  a  in  the  noisy  circuit  is  a  unitary  transforma¬ 
tion  Ga  acting  on  the  system  and  bath,  which  can  be 
expressed  as 

Ga  —  Qa  +  Ba-  (34) 

Here  Qa  is  the  “good  part”  of  the  operation;  it  can  be 
expressed  as  Ga®  Ba,  where  Ga  is  the  ideal  operation 
that  would  be  applied  to  the  system  in  the  absence  of 
noise,  and  Ba  is  a  unitary  transformation  acting  on  the 
bath.  The  operator  Ba  is  the  “bad  part,”  the  deviation 
of  Ga  from  the  ideal  operation,  which  acts  jointly  on 


system  and  bath.  (Recall  that  we  model  a  noisy  qubit 
preparation  as  an  ideal  preparation  followed  by  a  noisy 
unitary  transformation,  and  a  noisy  qubit  measurement 
as  a  noisy  unitary  transformation  followed  by  an  ideal 
measurement;  for  preparation  or  measurement  locations, 
Ga  denotes  the  noisy  transformation  that  follows  or  pre¬ 
cedes  the  ideal  preparation  or  measurement.)  In  this 
noise  model,  we  may  characterize  the  noise  strength  by 

fj  =  max  ||Ba||,  (35) 

a 

the  maximum  value  of  the  operator  norm  of  the  bad  part, 
where  the  maximum  is  with  respect  to  all  locations  in  the 
noisy  circuit.  The  threshold  theorem  asserts  that  an  ideal 
circuit  of  arbitrary  size  can  be  simulated  accurately  if  fj 
is  less  than  a  critical  value  rjo,  the  accuracy  threshold. 
The  threshold  theorem  proved  in  [El  actually  applies  to 
a  broader  class  of  noise  models  that  do  not  necessarily 
satisfy  the  local-bath  assumption,  but  this  class  includes 
the  noise  model  of  Sec.  El  as  a  special  case.  The  anal¬ 
ysis  in  [47]  established  a  lower  bound  on  the  accuracy 
threshold,  770  >  10”4. 

In  this  paper  we  will  relate  the  noise  strength  fj  defined 
by  Eq.  (l34l)  and  Eq.  |35l)  to  the  parameters  that  charac¬ 
terize  the  noise  model  defined  in  Sec.  El  We  denote  by 
?7dd  the  value  of  fj  that  can  be  achieved  using  dynamical 
decoupling,  and  we  denote  by  77  the  value  of  fj  achieved 
without  using  dynamical  decoupling.  If  ?7dd  <  Vi  then  we 
say  that  the  noise  is  below  the  noise  suppression  thresh- 
old,  meaning  that  dynamical  decoupling  reduces  the  ef¬ 
fective  noise  strength.  If  77  <  770,  then  we  say  the  noise 
is  below  the  accuracy  threshold,  meaning  that  scalable 
quantum  computing  is  possible. 

In  Sec.  El  we  express  ?7dd  in  terms  of  the  parameters  J 
and  e  defined  in  Eq.  m-m-  In  Sec.  El  we  express  77DD 
in  terms  of  properties  of  bath  correlation  functions,  using 
a  different  formula  than  Eq.  (|35D  which  is  explained  in 
Appendix  [j] 

IV.  DD-PROTECTED  GATES 
A.  Including  the  gate  pulse 

So  far  we  have  described  how  to  reduce  the  noise  in 
a  quantum  memory  using  cyclic  DD.  Now  we  want  to 
estimate  the  effective  noise  strength  achieved  by  DD  for 
operations  other  than  the  identity,  so  we  must  explain 
how  DD  is  used  to  suppress  the  noise  in  these  nontrivial 
operations,  We  will  describe  nontrivial  quantum  gates, 
postponing  discussion  of  preparations  and  measurements 
until  later. 

We  refer  to  one  cycle  of  the  DD  pulse  sequence  for  the 
identity  gate  as  the  “memory”  sequence.  To  perform  a 
DD-protected  nontrivial  gate  G0,  we  must  modify  the 
memory  sequence  accordingly.  In  fact  our  DD  pulse  se¬ 
quence  for  the  gate  is  exactly  the  same  as  the  memory 
sequence,  except  for  the  very  last  pulse.  If  the  memory  se¬ 
quence  of  R  pulses  ends  with  a  period  of  trivial  evolution, 
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then  we  append  a  pulse  implementing  Ga  to  the  end  the 
memory  sequence.  Thus,  if  the  memory  sequence  lasts 
time  t dd  and  the  pulse  width  is  6,  then  the  Ga  pulse  se¬ 
quence  lasts  time  to  =  tw  +  S  and  uses  N  =  R+l  pulses. 
If  on  the  other  hand  the  i?-pulse  memory  sequence  ends 
with  a  nontrivial  pulse  implementing  P,  then  we  combine 
this  pulse  and  the  gate  pulse  into  a  single  pulse  imple¬ 
menting  GaP.  Again,  we  denote  the  total  time  for  the 
Ga  pulse  sequence  by  to,  and  the  total  number  of  pulses 
by  N(=  R). 

While  we  assume  for  simplicity  that  every  pulse  has 
the  same  width  6,  we  recognize  that  in  some  cases  differ¬ 
ent  types  of  pulses  may  have  different  time  scales.  For 
example,  in  recent  experiments  with  quantum  dot  qubits, 
X  gates  are  implemented  using  (fast)  exchange  couplings 
and  Z  gates  are  implemented  using  (slow)  magnetic  field 
gradients  E3-  One  may  interpret  S  as  the  duration  of  the 
longest  pulse  used,  or  one  could  easily  refine  our  analysis 
by  allowing  different  pulses  to  have  different  widths. 

In  a  DD-protected  circuit,  each  Ga  gate  is  replaced  by 
the  corresponding  DD-protected  gate;  under  the  local- 
bath  assumption,  the  noisy  DD-protected  gate  is  a  uni¬ 
tary  transformation  denoted  Ga  acting  jointly  on  the  sys¬ 
tems  and  the  bath.  Though  the  duration  to  of  a  DD- 
protected  gate  is  longer  than  the  duration  tq  of  an  un¬ 
protected  gate,  the  DD-protected  gate  may  be  more  ac¬ 
curate  than  the  unprotected  gate,  if  the  noise  is  weak 
enough. 

At  the  end  of  the  complete  Ga  pulse  sequence,  the  uni¬ 
tary  operator  UCja(to)  generated  by  the  control  Hamilto¬ 
nian  Hc(t)  (which  now  includes  the  gate  pulse)  is 

Uc,a(to)  =  GaUc{tDD)  =  Ga,  (36) 

because  the  cyclic  memory  sequence  satisfies  Uc(tw)  =  I 
(up  to  a  possible  phase).  Therefore  the  noisy  DD- 
protected  gate  at  location  a  is 

Ga  =  Ua(to,0)  =  Uc,a(to)Ua(to,0)  =  GaUa(t0,0),  (37) 


where  Ua(to,0)  is  the  toggling-frame  time  evolution  op¬ 
erator.  The  corresponding  toggling-frame  Hamiltonian  is 
similar  to  the  toggling-frame  Hamiltonian  Eq.  (Oil)  for  the 
memory  sequence,  except  for  the  appended  gate  pulse: 


Hait)  =  Ula{t)HaUc,a(t ) 


(38) 


iAuH 


(fc-i) 


Ht 


(k-1)  -iAkH^~1'> 


(k) 


H, 

eiAR+1H £>  jj(R) e-zAR+1H^ 

GlHaGa 


for  t  G  [sfe,sfe  +  5), 
for  t  G  [sfc  +  5,  Sfc+i), 

for  t  G  [sR+1,  sR+i  +  (5), 
for  t  =  to- 


Eq.  (|38l)  applies  to  the  case  where  the  gate  pulse  is  ap¬ 
pended  to  the  end  of  the  memory  sequence;  the  mem¬ 
ory  sequence  contains  R  equally  spaced  pulses  labeled  by 
k  =  1,  2, . . . ,  R,  and  the  gate  pulse  begins  at  time  Sfl+i- 
The  DD-protected  qubit  measurement  is  the  memory 
pulse  sequence  followed  by  an  ideal  measurement.  We 
assume  that  the  measurement  takes  time  <5,  the  same  as 


the  pulse  width,  so  that  the  duration  to  of  the  protected 
measurement  matches  the  duration  of  the  DD-protected 
gate.  Similarly,  the  DD-protected  qubit  preparation  is 
an  ideal  preparation  lasting  time  S  followed  by  the  mem¬ 
ory  pulse  sequence.  See  Sec.  IV  Dl  for  discussion  of  how 
our  analysis  is  modified  when  preparations  and  measure¬ 
ments  are  slow  compared  to  other  operations. 


B.  Effective  noise  strength 


To  define  the  effective  noise  strength  for  the  DD- 
protected  gate,  we  divide  the  noisy  gate  into  a  good  part 
and  a  bad  part  as  in  Eq.  (l34l).  obtaining 

Ga  =  GaUB,a(to)  +  Ga  —  GaUB,a(to)  (39) 

=  Ca  =Ba 


The  good  part  Qa  describes  the  ideal  evolution  in  the 
absence  of  noise  (HelI  =  0)  —  the  ideal  gate  Ga  is  ap¬ 
plied  to  the  system,  while  the  bath  evolves  according  to 
its  unperturbed  Hamiltonian  HBa.  The  bad  part  Ba  de¬ 
scribes  the  effects  of  noise,  as  modified  by  the  DD  pulse 
sequence. 

Using  Eq.  (l37l)  and  the  unitary  invariance  of  the  oper¬ 
ator  norm,  we  obtain  an  expression  for  the  noise  strength 
of  the  DD-protected  circuit: 


?7dd  =  max  ||Ba|| 

a 


max 

a 


Ua[to ,  0) 


UB,a(to)  ; 


(40) 


this  is  just  the  norm  of  the  bad  part  expressed  in  the  tog¬ 
gling  frame.  In  what  follows,  we  will  sometimes  drop  the 
subscript  a  when  context  makes  the  intended  meaning 
clear. 

We  can  now  estimate  ?7dd  using  the  Magnus  expansion. 
We  write 


U{t0, 0)  =  exp[H(f0)]  =  exp[-it0HeS],  (41) 
where  i7eff  =  ^S2(fo),  and  fl(fo)  can  be  computed  using 
the  (gate-appended)  toggling- frame  Hamiltonian  H  (t)  in 
Eq.  (l38l).  To  bound  the  quantity  \\U(to,  0)  —  UB(to)\\,  we 
make  use  of  Lemma [3]  in  Appendix  [Cl  which  gives: 

\\U(to,0)-UB(to)\\  <t0\\HeS-HB\\.  (42) 

Inserting  the  Magnus  expansion  —itoHeff  =  fl(to)  = 
E)T=i  Mu  (to)  we  find 

iJdd  <  max  \\Heff  a  -  HB,a  || 

a 

oo 

=  max  ||  V  nn  a(to)  +  itoHB,a\\ 

a  L ' 
n—1 

oo 

<  max(||H/l  o(fo)||  +  J2  llJWto)||),  (43) 

n— 2 

where  a(t)  =  12i,0(t)  +  itHBta.  For  a  pulse  sequence 
that  achieves  first-order  decoupling  with  ideal  zero-width 
pulses,  Qj  O(to)  vanishes  in  the  limit  <5  — >  0.  To  derive  a 
useful  upper  bound  on  the  effective  noise  strength  i/dd, 
we  will  need  good  bounds  on  the  other  terms  in  Eq.  (1431). 


C.  The  effective  noise  strength  for  a 
time-symmetric  sequence 


We  say  that  the  memory  pulse  sequence  is  time- 
symmetric  (or  “palindromic”)  if  H(tr>T>  —  t)  =  H(t )  for 
t  G  [0,  £dd] •  We  will  show  in  Appendix  iBl  that  a  time- 
symmetric  pulse  sequence  that  achieves  first-order  decou¬ 
pling  also  achieves  second-order  decoupling.  However, 
the  time  symmetry  is  broken  if  we  construct  the  DD- 
protected  gate  by  appending  the  gate  pulse  to  the  mem¬ 
ory  sequence,  even  if  the  memory  sequence  by  itself  is 
time-symmetric. 

For  the  purpose  of  estimating  the  effective  noise 
strength,  we  can  nearly  restore  the  time  symmetry  of 
the  DD-protected  gate  by  a  simple  trick  (see  Fig.  [3]). 
Recalling  that  our  goal  is  to  derive  an  upper  bound  on 
|| C/ (to , 0)  —  UB(t0)\\,  we  observe  that  the  unitary  invari¬ 
ance  of  the  operator  norm  implies 

\\U(to,0)-UB(to)\\  =  \\U(to,0)Ul(S)-UB{to-S)\\,  (44) 


where  6  is  the  width  of  the  gate  pulse,  and  to  =  £dd  +  8. 
Furthermore,  we  may  regard  (/(to,  0)Ug(8)  as  the  time 
evolution  operator  generated  by  the  Hamiltonian 


HM(t )  = 


-Hb  t  G  [0, 5), 
H(t  —  8)  f  G  [S,T], 


(45) 


where  T  =  to  +  5.  If  the  memory  sequence  is  time- 
symmetric,  then  Hm(T  —  t)  =  Hm^P)  for  t  G  [<5, T  —  <5]. 
Thus  HM(t)  is  “nearly  time-symmetric”  in  the  interval 
[0,  T\;  the  symmetry  is  broken  only  in  the  small  intervals 
[0,  <5]  and  [T  —  S ,  T]  at  the  beginning  and  end  of  [0,  T\. 

The  unitary  operator  U(to,0)Ug(5)  =  exp[H(T)]  can 
be  computed  using  the  Magnus  expansion  for  Hamilto¬ 
nian  HM(t).  Viewing  [7(fo,  0){7jj(<5)  as  being  generated 
by  the  time-independent  Hamiltonian  ifl(T)/(to  —  S )  for 
time  to  —  5,  and  again  using  Lemma  [3]  in  Appendix  ICl  we 
obtain  instead  of  Eq.  (031), 


77dd  =  max  \\Ua(to,  0)Ug  a(S)  -  UBya(t0  -  <5)|| 

a  ’ 

<  max ||f2a(T)  +  i(T  —  28)HB  a\\ 

a 

oo 

=  max  (llfi^Cmi  +  £  ||H„,q(T)||)  ,  (46) 


where  a(T)  is  now  defined  as  a(T)  =  Hi  a(T)  + 
i(T-28)HB,a. 

More  generally,  we  say  that  the  Hamiltonian  f?M(t) 
is  nearly  time-symmetric  in  [0,T]  if  the  time  symmetry 
holds  everywhere  except  in  a  small  interval  or  the  disjoint 
union  of  several  small  intervals.  We  denote  by  A  the 
region  in  which  the  time  symmetry  is  violated;  thus 

/  Hm(T  -  t)  =  HM(t )  for  t£  A  .  , 

\  Hm(T  -t)yt  HM(t)  for  t  G  A. 

We  also  use  the  same  symbol  A(<C  T)  to  denote  the  total 
length  of  this  region.  Thus  A  =  0  for  a  perfectly  time- 
symmetric  sequence.  In  what  follows,  we  will  sometimes 


say  that  the  pulse  sequence  realizing  a  DD-protected 
gate  is  “time-symmetric”  if  the  corresponding  memory 
sequence  is  time-symmetric,  even  though  the  time  sym¬ 
metry  may  be  broken  by  the  gate  pulse  appended  to  the 
memory  sequence.  We  say  that  the  memory  sequence 
and  the  DD-protected  gates  are  “general”  if  the  memory 
sequence  has  no  special  time  symmetry  properties.  We 
note,  though,  that  even  in  the  general  case,  we  may  say 
that  a  pulse  sequence  with  N  pulses,  each  of  duration  <5, 
is  nearly  time-symmetric  with  A  =  2 N8,  since  of  course 
Hm(T  —  t)  =  HM(t ),  if  the  control  Hamiltonian  vanishes 
at  time  t  and  at  time  T  —  t. 


V.  EFFECTIVE  NOISE  STRENGTH  AND 
THRESHOLD  CONDITIONS 

In  this  section,  we  state  some  of  our  conclusions  con¬ 
cerning  the  effective  noise  strength  ?7dd  achieved  by 
dynamical  decoupling,  and  the  implications  for  fault- 
tolerant  quantum  computing.  Derivations  will  be  post¬ 
poned  until  Sec.lVIIl  Here  we  focus  on  results  derived  us¬ 
ing  the  Magnus  expansion;  results  relating  ?7dd  to  proper¬ 
ties  of  bath  correlation  functions  are  discussed  in  Sec.  s 


A.  Bounds  on  Magnus  expansion 

Combining  Eq.  (1151)  and  Eq.  (1151).  we  can  state  our 
upper  bound  on  the  effective  noise  strength  rjun  as 

OO 

r?DD<||«i(r)ll+Ell^(T)H’  (48) 

n-2 

where  Q[(T)  =  rii(T)+i(T—2T)HB,  T  =  to+T,  and  the 
maximum  over  all  locations  is  understood.  The  Magnus 
expansion  H(T)  =  Y^=i^n(T)  is  computed  using  the 
Hamiltonian 


M  (t)  =  !  ~Hn  *e[0,r), 

M[  >  -\H(t-T)  =  HB  +  HeiI(t - r)  t  G  [r,T], 

(49) 

For  the  general  case,  in  which  we  are  not  trying  to  exploit 
the  time  symmetry  of  the  memory  sequence,  we  choose 
r  =  0.  For  the  nearly-time-symmetric  case  we  choose 
r  =  8,  and  Hm  is  time-symmetric  in  the  interval  [5,  T—S]. 

If  the  memory  sequence  achieves  first-order  decoupling 
in  the  limit  8  — >  0,  then  ||H,1(T)||  vanishes  apart  from 
finite-width  corrections.  The  nth-order  Magnus  term 
H„(T)  for  n  >  2  satisfies  ||H„(T)||  =  0((eT)n),  because 
||.ffM(i)||  <  e,  and  the  integral  fln(T)  can  be  bounded 
by  the  volume  of  the  integration  region  times  an  up¬ 
per  bound  on  the  integrand.  In  fact,  this  estimate  can 
be  improved  to  ||H„(T)||  =  O  (( JT)(eT)n^1) ,  because 
HM(t )  has  the  form  ±HB  +  H'(t)  where  H'(t )  is  either  0 
or  Herr(t);  therefore  \\[HM(ti),  HM(t2)}\\  =  0(Je ),  since 
||ii7(f)||  <  J  and  Hb  commutes  with  itself. 


9 


increasing 

time 


t  =  ^dd/2 


ti 

I 


t  =  0 

time  t 


t  =  *dd/2  4 
symmetry 
point  1 2 


t  —  *DD 


t  =  *dd/2  4:1 

I 


t  =  °  . .  . 

-  -4 —  time  t 


tl  +  <5  5 

|  |*  =  o 


(5/2 


-►  time  t 


t  =  t.0/ 2 


/  t 

tn/2 


time  t 

/  t  =  to 


B  <  time  t 


t2 


(a) 


i  —  ^DD 

00 


! 

t2  +  S 


-►  time  t 


t  =  T 


(c) 


FIG.  3:  Schematic  representation  of  HM{t)  for  time-symmetric  DD  pulse  sequences.  The  time  axis  is  bent  in  half,  with  time 
flowing  counterclockwise  from  the  upper  right  to  the  lower  left,  so  that  times  aligned  on  the  upper  and  lower  branches  are  related 
by  time  symmetry,  (a)  Two  pulses  (marked  as  black  boxes)  in  a  time-symmetric  memory  sequence  with  H(tnD  —  t)  =  H(t); 
the  pulse  on  the  bottom  branch  is  the  time-reversed  version  of  the  pulse  on  the  top  branch,  (b)  Appending  the  gate  pulse  (box 
G )  to  the  memory  sequence  spoils  the  time  symmetry;  the  black  pulses  on  the  upper  and  lower  branches  are  no  longer  aligned, 
(c)  Appending  fictitious  time  evolution  governed  by  —Hb  during  t  £  [0,  5]  (box  B)  restores  the  time  symmetry  of  the  memory 
sequence  for  t  £  [<5,  T  —  <5],  where  T  =  to  +  5. 


We  anticipate,  then,  that  at  any  location  a,  the  terms 
in  the  Magnus  expansion  can  be  bounded  as 

||ni(r)[|  <  Cr(JT);  (50a) 

||fi»(T)||  <  Cn(JT)(eT)n~\  n  =  2,3,4;  (50b) 

OO 

J2\\nn(T)\\  <  C5(JT)(eT)\  (50c) 

n= 5 

where  C±,  C2,  C3,  C4,  C5  are  constants.  Note  that  the  last 
of  these  results  bounds  the  sum  of  all  high-order  Magnus 
terms  with  n  >  5.  Combining  Eq.  (H51)  and  Eq.  (1501)  we 
find 

5 

)JDD  <  (JT)  J2  Cu(eT)n-\  (51) 

n—1 

The  constants  Cn,  derived  in  Sec.lVIIl  are  listed  in  Table 
U  for  both  general  and  time-symmetric  pulse  sequences. 
Our  value  of  C5,  obtained  by  bounding  an  infinite  series, 
holds  only  for  eT  <  0.54,  a  condition  likely  to  be  satisfied 
when  DD  works  effectively.  If  desired,  tighter  bounds 
can  be  derived  on  the  nth  order  terms  with  n  >  5  using 
results  from  Sec.  IVIII  However,  we  judge  Eq.  (]5()c|)  to 
be  good  enough  for  our  purposes,  since  this  bound  on 
the  sum  of  higher-order  terms  is  already  rather  small  for 
eT  -C  1,  as  in  typical  cases  of  interest. 

Our  bound  on  fli(T)  is  not  improved  by  invoking  time 
symmetry,  but  for  n  =  2,  3,4,  the  bounds  listed  in  Table 
Uare  tighter  for  the  time-symmetric  case  than  the  general 
case,  assuming  A /T  <C  1.  In  fact,  C2  and  C4  vanish  in 
the  limit  A/T  — >  0,  reflecting  the  property  that  all  even- 
order  terms  in  the  Magnus  expansion  vanish  when  the 
time  symmetry  is  exact.  For  the  time-symmetric  case, 
we  derive  bounds  on  Cn  for  even  n  >  6  in  Appendix  iDl 
but  these  results  were  not  used  in  our  estimate  of  C5. 


General 

Nearly  time-symmetric 

Cl 

2 NS/T  in  general, 

N5/T  if  pulses  are  regularly  spaced  in  time, 

c2 

1 

4(#)  (!-5t) 

c3 

4/9 

(2/9){l  +  15(A)(l-MA)} 

CA 

11/9 

56  (A)  (1-1  A) 

c5 

9.43 

TABLE  I:  Constants  { Cn }  appearing  in  the  upper  bound 
Eq.  (1501)  on  the  Magnus  expansion,  for  the  general  case  and 
the  nearly-time-symmetric  case.  N  denotes  the  total  num¬ 
ber  of  pulses  in  the  DD-protected  gate,  <5  is  the  pulse  width, 
T  =  to  in  the  general  case,  and  T  =  to  +  <5  in  the  nearly- 
time-symmetric  case.  For  the  nearly-time-symmetric  case,  A 
is  the  size  of  the  small  region  in  which  the  time  symmetry  is 
violated. 

B.  Noise  suppression  threshold 

DD-protected  gates  will  outperform  unprotected  gates 
if  the  noise  is  weak  enough.  In  a  circuit  of  unprotected 
gates,  each  gate  is  realized  by  a  single  pulse,  where  the 
pulses  are  separated  in  time  by  the  pulse  interval  To .  For 
the  noise  model  of  Sec.  IE  the  effective  noise  strength  for 
this  computation  may  be  expressed  as  {13,  EH 

77  =  (m^x  ||Ffss,a||^  t0.  (52) 

Eq.  (|52l)  is  not  derived  using  the  Magnus  expansion; 
rather  it  follows  directly  from  Lemma  [3]  in  Appendix  [C] 
If  we  assume  that  Hg  =  0,  Eq.  (15121)  becomes 

t]  =  Jt0.  (53) 

We  say  that  the  noise  model  satisfies  the  noise  suppres¬ 
sion  threshold  condition  if  the  effective  noise  strength  can 
be  reduced  by  using  DD-protected  gates  instead,  i.e.,  if 

Vud  <  V-  (54) 
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In  our  noise  model,  this  condition  can  be  expressed  in 
terms  of  the  parameters  erg,  6/tq  and  rg/fg. 

For  example,  continuing  to  assume  that  Hg  =  0, 
suppose  in  addition  that  5/tq  is  negligible  and  eT  is 
small  enough  so  that  the  Magnus  expansion  is  well- 
approximated  by  the  lowest-order  nonzero  term.  Then, 
in  the  general  (non-time-symmetric)  case,  using  C 1  =  0 
and  (72  =  1,  we  can  approximate  ryoD  by 

)=(d^)(j-);  (55) 

we  use  the  ~  symbol  to  emphasize  that  higher  order  cor¬ 
rections  in  6/ To  and  et g  are  neglected.  Note  that  vd d  de¬ 
pends  on  the  norm  of  the  bath  Hamiltonian  ||iJg||  (which 
contributes  to  e),  while  v  does  not.  The  noise  suppres¬ 
sion  threshold  condition  rjjon  <  rj  =  Jtq  is  satisfied  for 


The  expression  Eq.  (l55l)  for  7?dd  indicates  that  to 
achieve  effective  noise  suppression  we  should  favor  short 
DD  pulse  sequences  (with  to/ To  not  too  large)  to  mini¬ 
mize  the  exposure  to  noise  during  the  DD-protected  gate. 
On  the  other  hand  Eq.  (l58l)  illustrates  that  a  longer  pulse 
sequence  can  pay  off  if  it  allows  us  to  achieve  higher- 
order  decoupling.  These  results  exemplify  a  more  general 
tradeoff  between  shorter  sequences  and  better  decoupling 
that  must  be  optimized  to  design  DD-protected  gates 
with  the  smallest  possible  effective  noise  strength.  The 
tradeoff  is  also  manifested  by  the  analysis  in  Sec.  IVIlJI  of 
concatenated  DD  pulse  sequences. 


C.  Accuracy  threshold  and  overhead  cost 


ero  < 


or 


er0  <  N  2 


(56) 


(57) 


for  a  sequence  of  N  equally  spaced  pulses.  As  the  pulse 
sequence  grows,  the  duration  f0  of  DD-protected  gates 
increases  relative  to  the  duration  tq  of  unprotected  gates, 
and  Eq.  (l56l)  imposes  a  stronger  restriction  on  e. 

In  the  limit  of  zero-width  pulses,  a  time-symmetric 
pulse  sequence  that  achieves  first-order  decoupling 
achieves  second-order  decoupling  as  well,  so  that  C\  = 
C2  =  0.  Imposing  time  symmetry  may  lengthen  the 
pulse  sequence;  we  denote  the  duration  of  a  DD-protected 
time-symmetric  gate  by  fg,  to  contrast  with  the  dura¬ 
tion  to  of  the  gate  when  the  pulse  sequence  is  not  time- 
symmetric.  In  the  time-symmetric  case,  the  effective 
noise  strength  becomes  (assuming  5  =  0  and  thus  T  =  to, 
and  using  C3  =  2/9) 


Vdb  ~  ^(JT)(eT)2 


2  /  Jtq  \  /  erg  \2 

9  \To/t'0J  \To/t'o) 


(58) 


Therefore  the  noise  suppression  threshold  condition 
?7dd  <  J tq  is  satisfied  if 


or 


(59) 


-^'T372 


(60) 


for  the  case  of  N'  equally  spaced  pulses.  Even  though 
t'0  >  to,  Eq.  (15^1)  places  a  less  stringent  condition  on  e 
than  Eq.  m.  provided  tg/to  <  (9to/2r0)1/3.  We  empha¬ 
size  again  that  Eq.  (l56l)  and  Eq.  (l59l)  are  derived  using 
lowest-order  approximations  in  an  expansion  in  5/ To  and 
e. 


(a) 


(b) 


^DD 


threshold 

- 1 - 1- 

%  n 
threshold 

H - 1 - 

1  % 


noise  strength 


noise  strength 


FIG.  4:  Two  scenarios  where  DD-protected  gates  outper¬ 
form  unprotected  gates,  (a)  Quantum  computing  is  scalable 
with  DD-protected  gates,  but  not  with  unprotected  gates, 
(b)  Quantum  computing  is  scalable  with  either  DD-protected 
gates  or  with  unprotected  gates,  but  DD  reduces  the  overhead 
cost  of  fault  tolerance. 

A  quantum  computation  unprotected  by  DD  is  scalable 
if  the  noise  strength  of  unprotected  gates  is  below  the 
accuracy  threshold,  17  <  r/0.  For  DD-protected  gates, 
the  accuracy  threshold  condition  becomes  rju d  <  Vo-  If 
the  noise  suppression  threshold  condition  is  satisfied,  so 
that  tjud  <  Vi  it  may  be  that  v  >  Vo  and  ?7dd  <  Vo', 
in  that  case,  arbitrarily  large  quantum  circuits  can  be 
simulated  accurately  with  DD-protected  gates,  but  not 
with  unprotected  gates.  This  is  illustrated  in  Fig.QJa). 

Even  when  77  <  770 ,  DD  may  reduce  the  overhead  cost  of 
fault-tolerant  quantum  computing  if  ??dd  <  V  (Fig- 01b)). 
Suppose  that  we  wish  to  simulate  an  ideal  quantum  cir¬ 
cuit  containing  L  gates.  If  our  noisy  gates  have  noise 
strength  fj ,  which  is  below  the  threshold  value  770 ,  the 
simulation  is  possible  using  L*  noisy  gates  where  |31j 

l-  fi0g(cn„L/e)Y 

T  =  (  log(W>7>  )  :  <6I) 

here  c  and  a  are  constants,  and  0  is  the  “error”  in  the 
simulation  (the  L 1  distance  between  the  ideal  probabil¬ 
ity  distribution  of  outcomes  and  the  simulated  distri¬ 
bution).  Denote  by  L*n  the  number  of  pulses  in  the 
fault-tolerant  circuit  built  from  unprotected  gates,  and 
by  LqD  the  number  of  pulses  in  the  fault-tolerant  circuit 
built  from  DD-protected  gates,  and  suppose  that  each 
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DD-protected  gate  uses  N  pulses,  while  each  unprotected 
gate  uses  a  single  pulse.  Then  the  ratio 


App  =  Nf  logfa  h)  \ 

Kn  V0g(??o/?7pp)  / 


(62) 


is  independent  of  the  size  L  of  the  simulated  circuit. 
If  using  DD  substantially  improves  the  effective  noise 
strength,  LJjD/L*n  may  be  small,  especially  if  rj  is  only 
slightly  below  the  threshold  value  rj0.  Even  though  each 
DD-protected  gates  requires  multiple  pulses,  the  total 
number  of  pulses  used  in  the  simulation  may  be  reduced, 
because  DD  improves  the  gate  accuracy. 

Of  course,  we  have  reached  this  conclusion  using  the 
local-bath  assumption,  which  allows  us  to  assign  a  well- 
defined  effective  noise  strength  to  the  DD-protected  gate. 
Furthermore  our  results  are  useful  only  if  the  Hamilto¬ 
nian  of  the  local  bath  has  finite  norm  (so  that  e  <  oo). 
However,  we  will  see  that  the  correlation  function  anal¬ 
ysis  in  Sec.  [X]  can  provide  useful  upper  bounds  on  ??dp 
even  if  e  is  infinite. 


D.  Slow  preparations  and  measurements 

Another  drawback  of  this  analysis  is  that  our  model  of 
qubit  preparations  and  measurements  may  be  unrealistic 
in  some  physical  situations.  In  our  estimates  of  the  effec¬ 
tive  noise  strength  in  a  DD-protected  quantum  computa¬ 
tion,  we  have  treated  preparations  and  measurements  like 
gates.  We  have  assumed  that  each  preparation  and  mea¬ 
surement  location  in  the  circuit,  like  each  gate  location, 
has  duration  to-  A  DD-protected  preparation  location 
consists  of  a  preparation  taking  time  S  followed  by  a  DD 
memory  sequence,  and  a  DD-protected  measurement  lo¬ 
cation  consists  of  a  DD  memory  sequence  followed  by  a 
measurement  taking  time  S.  Thus  we  have  assumed  that 
the  preparations  and  measurements  are  just  as  fast  as  the 
pulses.  In  some  physical  systems,  however,  preparations 
and  measurements  are  relatively  slow;  in  solid-state  de¬ 
vices,  for  example,  the  measurement  time  can  be  orders 
of  magnitude  longer  than  the  gate  time. 

If  the  actual  time  S  required  for  a  preparation  or  mea¬ 
surement  is  longer  than  the  pulse  width  S  but  still  short 
compared  to  the  pulse  interval  tq,  then  we  could  still  try 
to  improve  measurements  and  preparations  using  DD  se¬ 
quences.  If  it  makes  sense  to  model  the  noise  during 
a  preparation  or  measurement  as  we  have  modeled  the 
noise  in  the  pulses,  then  we  could  modify  our  analysis 
by  using  the  measurement  width  S  in  estimating  the  ef¬ 
fective  noise  strength  ?7dd  at  preparation  and  measure¬ 
ment  locations,  while  continuing  to  use  the  pulse  width 
S  in  estimating  ijdd  at  gate  locations.  But  if  S  >  To, 
or  more  generally  if  the  noise  in  preparations  and  mea¬ 
surements  is  modeled  much  differently  than  the  noise  in 
gates,  then  it  may  be  more  appropriate  to  consider  the 
preparation/measurement  noise  strength  to  be  a  separate 
parameter  in  the  analysis,  not  necessarily  related  to  the 


parameters  J  and  e  that  characterize  the  noise  Hamilto¬ 
nian  described  in  Sec.  EH  and  appear  in  the  estimate  of 
?7dd  at  gate  locations. 

Measurement  locations  might  be  much  noisier  than 
gate  locations  because  gates  can  be  improved  using  se¬ 
quences  of  fast  DD  pulses,  while  slow  measurements  can¬ 
not  be  improved  by  DD.  Or  measurements  might  be  nois¬ 
ier  than  gates  for  other  quite  different  reasons.  Previ¬ 
ous  work  has  shown  that  scalable  fault-tolerant  quantum 
computing  is  still  possible,  and  that  the  accuracy  thresh¬ 
old  is  not  much  affected,  when  measurements  are  much 
slower  than  gates  [H|.  What  deserves  further  study, 
though,  is  how  fault-tolerant  circuit  design  can  be  opti¬ 
mized  when  measurements  are  much  noisier  than  gates. 


VI.  EXAMPLES 

Now  we  will  analyze  the  effectiveness  of  several  dif¬ 
ferent  DD  pulse  sequences,  applying  the  results  from 
Sec.  IV  Al  We  adopt  a  noise  model  that  includes  only 
single-qubit  errors  acting  on  the  system;  thus  the  noise 
Hamiltonian  is 

F  =  +  (63) 

i,ot 

fj\ 

where  i  labels  the  qubits,  <ja  for  a  =  x,y,z  are  the  Pauli 
operators  acting  on  qubit  i,  and 

Hb  =  Is  <8>  B0.  (64) 

In  some  realistic  situations,  such  as  electron-spin  qubits 
interacting  with  a  nuclear  spin  bath  0,0013,  such 
single-qubit  errors  are  the  dominant  noise  in  the  system. 

In  principle,  He rr  could  also  contain  errors  that  act 
collectively  on  several  qubits  at  once;  for  example,  errors 
acting  jointly  on  two  qubits  might  be  expected  to  occur 
during  the  execution  of  a  two-qubit  gate.  Efficient  DD 
pulse  sequences  can  be  constructed  that  suppress  rnulti- 
qubit  errors  (4f|  [5Q|,  but  in  this  Section  we  limit  our 
attention  to  single-qubit  noise  and  pulse  sequences  that 
combat  it.  The  more  general  results  in  Sec. IV  Alcan  also 
be  applied  to  other  models  that  include  multi-qubit  noise 
and  to  the  corresponding  pulse  sequences  that  achieve 
first-order  decoupling  for  such  noise. 

We  will  discuss  three  different  DD  pulse  sequences  that 
can  suppress  the  single-qubit  noise.  The  first  is  the  sim¬ 
plest  DD  scheme  that  protects  against  arbitrary  single¬ 
qubit  errors.  The  second  is  a  time-symmetric  sequence 
that  achieves  second-order  decoupling  in  the  limit  of  zero- 
width  pulses.  The  third  is  the  Eulerian  DD  scheme, 
which  is  more  robust  against  pulse  errors  than  the  other 
schemes. 


A.  Universal  decoupling  sequence 

The  shortest  pulse  sequence  that  suppresses  arbitrary 
single-qubit  errors  is  called  the  “universal  decoupling  se- 
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quence”  d,  Hq|>  or  “XY-4”  in  the  NMR  literature  [5lj. 
For  this  sequence,  the  unitary  operator  generated  by  the 
control  Hamiltonian,  acting  on  a  single  qubit,  can  be  ex¬ 
pressed  as 

Uc(tDD)  =  ZIXIZIXI.  (65) 

The  notation  in  Eq.  (|65D  is  meant  to  convey  that  one  com¬ 
plete  cycle  of  the  memory  sequence  contains  four  equally 
spaced  pulses  (each  of  width  8)  that  successively  apply 


the  Pauli  operators  X ,  Z ,  X ,  Z,  where  X  =  <jx  and 
Z  =  <rz;  therefore  the  product  of  the  four  Pauli  opera¬ 
tors  is  —I.  Each  I  in  Eq.  (|65l)  represents  trivial  evolution 
during  the  pulse  interval  of  width  To  —  6.  The  total  du¬ 
ration  of  the  pulse  sequence  is  toD  =  4ro- 

This  sequence  achieves  first-order  decoupling.  In  the 
limit  of  zero-width  pulses,  the  toggling  frame  Hamilto¬ 
nian  is 


H(t)  =  Ul(t)HUc{t) 


IHI  =  HB  +  X  ®  Bx  +  Y  0  By  +  Z  ®  Bz 
XHX  =  HB  X  <S>  Bx  —  Y  0  By  —  Z  0  Bz 
YHY  =  HB  -  X  ®  Bx  +  Y  ®  By  -  Z  ®  Bz 
ZHZ  =  HB  —  X  0  Bx  —  Y  0  By  -\ -  Z  <Z>  Bz 


for  t  G  [0,  To), 
for  t  G  [t0,2t0), 
for  t  G  [2to,  3to), 
for  t  G  [3t0,  4t0), 


(66) 


and  we  find 
^i(£dd) 


dtH(t ) 


-ir0  (IHI  +  XHX  +  YHY  +  ZHZ) 


—UudHb, 


(67) 


a  pure-bath  term.  The  first-order  Magnus  term  (up  to 
the  factor  — ^dd)  is  the  Pauli-group  average  of  the  noise 
Hamiltonian  H,  which  commutes  with  any  nontrivial 
Pauli  operator  acting  on  the  system  qubit. 

In  a  DD-protected  gate,  the  final  pulse  in  the  universal 
decoupling  sequence  is  modified  by  combining  with  the 
gate  pulse.  For  a  single-qubit  gate,  the  pulse  sequence 
realizing  the  gate  G  is 


Uc(t0)  =  {GZ)1X1ZIX1,  (68) 

where  now  GZ  represents  a  single  pulse  with  duration  5 
and  to  =  tun-  In  a  two-qubit  gate,  the  universal  pulse 
sequence  is  applied  in  parallel  to  both  qubits,  except  that 
the  final  pulse  Z  ®  Z  in  the  memory  sequence  is  replaced 
by  the  two-qubit  pulse  G(Z  (g>  Z). 

To  estimate  the  effective  noise  strength  t^dd,  we  note 
that  the  total  number  of  pulses  is  IV  =  4,  and  that 
To/to  =  1/4-  From  the  bounds  in  Eq.  (15(1  and  Table 
U  (for  the  case  where  the  sequence  is  not  time  symmet¬ 
ric)  we  obtain 


t?dd  =  (4Jt0) 


—  +4eT0  +  ^(4ct0)2 

To  9 

+  ii(4eT0)3  +  9.43(4eT0)4 

y 


(69) 


where  we  have  used  C\  =  NS /to  because  the  pulses  are 
regularly  spaced  in  time.  Note  that  the  parameters  J 
and  e  include  sums  over  all  qubits  in  the  set  Qa  that 
participate  in  the  gate  at  location  a  in  the  circuit. 


FIG.  5:  Plot  of  ?Jdd/?7  versus  ero  for  the  universal  decou¬ 
pling  sequence  (Eq.  169 II  and  for  the  time-symmetric  sequence 
CEq.  17411 .  assuming  zero- width  pulses.  The  noise  strength  for 
the  DD-protected  gate  is  weaker  than  the  noise  strength  for 
the  unprotected  gate  for  ero  <  0.0511  in  the  case  of  the  uni¬ 
versal  decoupling  sequence,  and  for  eto  <  0.0403  in  the  case 
of  the  time-symmetric  sequence.  For  ero  sufficiently  small,  us¬ 
ing  the  time-symmetric  sequence  reduces  the  noise  strength 
further  than  the  universal  decoupling  sequence. 


In  Fig.  [5l  tjud/v  (where  r]  =  Jtq)  is  plotted  as  a  func¬ 
tion  of  6To ,  in  the  limit  S/tq  — >  0.  The  noise  suppression 
threshold  condition  tjud  <  V  is  satisfied  when 

£T0  <  0.0511.  (70) 


In  the  limit  ero  — >  0,  the  noise  suppression  threshold 
condition  is  satisfied  for 


(71) 
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B.  Time-symmetric  sequence 

We  can  construct  a  time-symmetric  DD  sequence  by 
composing  two  copies  of  the  universal  decoupling  se¬ 
quence  —  first  we  perform  the  sequence  in  the  forward 
direction,  and  then  run  it  backwards  in  time.  For  zero- 
width  pulses,  using  the  same  notation  as  in  Eq.  (l65l).  in 
which  I  represents  trivial  evolution  for  time  To  between 
successive  pulses,  this  sequence  can  be  expressed  as 

Uc{tDD)  =  IXIZIXIIXIZIXI,  (72) 


so  that  tq /to  =  1/8  and  T  =  to  +  S.  From  the  bounds 
in  Eq.  (15U1)  and  Tabic  U  (for  the  case  where  the  sequence 
is  nearly  time-symmetric)  we  obtain  an  estimate  of  the 
effective  noise  strength  ?7dd  of  the  DD-protected  gates; 
we  may  use  C\  =  N5/T  <  S/t0  because  the  pulses  are 
regularly  spaced  in  time. 

In  the  limit  of  zero- width  pulses  (5/t0  — ►  0),  the  effec¬ 
tive  noise  strength  becomes 


??dd  =  (8  Jt0) 


^(8er0)2  +  9.43(8er0)4  ; 


(74) 


where  we  have  combined  the  two  Z  operators  in  the 
middle  into  a  zero-width  identity  “pulse”  (not  shown  in 
Eq.  fl?2D).  The  total  duration  of  the  pulse  sequence  is 
foD  =  8ro,  twice  as  long  as  the  universal  decoupling 
sequence.  Like  the  universal  decoupling  sequence,  this 
sequence  achieves  first-order  decoupling.  In  addition,  it 
satisfies  the  time-symmetry  property  Uc(tw—t)  =  Uc(t ), 
so  that  the  toggling- frame  Hamiltonian  obeys  ii/toD  — 
t)  =  H(t),  and  thus  this  pulse  sequence  achieves  second- 
order  decoupling  as  well.  This  pulse  sequence  is  known 
in  the  NMR  literature  as  “XY-8”  [H3|. 

For  finite-width  pulses,  we  modify  our  notation  to  em¬ 
phasize  that  the  second  half  of  the  sequence  is  the  time 
reverse  of  the  first  half.  We  write 

uc(tD  D)  = 

(73) 

Now,  each  I  represents  trivial  evolution  for  time  (tq  —  S) . 
The  I5  operator  in  the  middle  represents  trivial  evolu¬ 
tion  for  time  S,  arising  from  combining  two  Z  pulses. 
It  might  seem  more  natural  to  use  125  instead,  match¬ 
ing  the  total  duration  of  two  Z  pulses  each  with  width 
S ,  but  we  choose  the  sequence  Eq.  (1731)  so  that  our  up¬ 
per  bound  on  (T),  the  dominant  Magnus  term  when 
6/ tq  is  negligible,  will  have  a  simple  form.  Since  5/tpp, 
is  small  anyway,  it  does  not  matter  much  which  of  these 
sequences  we  choose.  X^  and  Z^  represent  finite- 
width  pulses  implementing  X  and  Z.  Before  the  mid¬ 
point  of  the  sequence  at  t  =  tuu/^,  the  X  pulses  are 
executed  using  the  constant  Hamiltonian  Hpx  such  that 
X  =  exp (—iSHpx)  and  the  Z  pulse  is  executed  us¬ 
ing  Hpz  such  that  Z  =  exp (—i8Hpz),  assuming  the 
pulses  are  perfectly  rectangular.  After  the  midpoint, 
the  universal  decoupling  sequence  runs  backwards;  X 
is  executed  using  —  Hpx  and  Z  using  — Hp Thus, 
Uc(tr>D  -t)  =  Uc(t). 

Appending  the  gate  pulse  to  this  memory  sequence 
breaks  the  time  symmetry,  which  can  be  nearly  restored 
using  the  trick  explained  in  Sec.  IIV  Cl  The  region  A  in 
which  the  time  symmetry  is  violated  is  the  union  of  two 
intervals:  the  duration  of  the  gate  pulse,  and  its  image 
under  time  reversal,  during  which  evolution  is  governed 
by  the  Hamiltonian  —Hp-  Thus  A  =  25  (recall  that  we 
use  A  to  denote  both  the  region  and  its  size).  The  DD- 
protected  gate  contains  N  =  8  pulses  (seven  pulses  in 
the  memory  sequence,  including  the  identity  pulse  in  the 
middle,  plus  the  gate  pulse)  and  has  duration  to  =  8tq, 


Vdd/v  is  plotted  in  Fig.  0  The  noise  suppression  thresh¬ 
old  condition  ?7dd  <  77  is  satisfied  when 

er0  <  0.0403  (75) 

This  condition  is  more  stringent  than  for  the  universal 
decoupling  sequence,  which  is  not  surprising  since  the 
time-symmetric  sequence  is  twice  as  long.  As  Fig.  0 
illustrates,  the  time-symmetric  sequence  becomes  more 
advantageous  when  etq  is  small,  as  is  likely  to  be  the  case 
when  rjjoD  is  below  the  accuracy  threshold  770 .  In  the  limit 
ero  — >  0,  only  C\  survives,  and  we  find  ?7dd  <  877(5 / To) ; 
thus  the  noise  suppression  threshold  condition  is  satisfied 
for 


The  largest  permissible  pulse  width  is  half  as  large  as  in 
the  case  of  the  universal  decoupling  sequence  (eq.  (1711)1 
because  the  time-symmetric  sequence  is  twice  as  long. 

Using  Eq.  (l62l)  and  the  expressions  for  ?7dd  in  Eq.  (l69l) 
(with  S/t 0  =  0)  and  Eq.  (ITT)),  we  plot  in  Fig.  0the  ra¬ 
tio  LpD/L*n  versus  ero  for  both  the  universal  decoupling 
sequence  and  the  time-symmetric  sequence.  Here,  just 
to  illustrate  the  idea  that  DD  can  drastically  reduce  the 
overhead  requirements  for  fault-tolerant  quantum  com¬ 
puting,  we  have  assumed  770 / 77  =  2,  and  we  have  taken  the 
value  a  =  log2(291)  «  8.18  from  [3lJ  (291  is  the  number 
of  locations,  including  measurements  and  preparations, 
contained  in  the  fault-tolerant  CNOT  gadget  constructed 
in  Hi).  Because  the  noise  strength  for  the  unprotected 
gate  is  close  to  the  threshold  value,  and  because  ero  is  well 
below  the  noise  suppression  threshold  for  each  DD  se¬ 
quence  in  the  range  plotted,  the  reduction  in  the  number 
of  pulses  achieved  by  using  DD-protected  gates  is  sub¬ 
stantial.  Furthermore,  although  the  time-symmetric  se¬ 
quence  is  longer  than  the  universal  decoupling  sequence, 
the  time-symmetric  sequence  reduces  the  total  number 
of  pulses  more  effectively  than  the  universal  sequence,  by 
more  than  two  orders  of  magnitude  for  ero  <  10-2. 

For  some  noise  models,  the  value  of  7?dd  derived  by 
our  general  arguments  may  be  overly  pessimistic.  For 
example,  using  the  time-symmetric  sequence  Eq.  dZ2D, 
we  computed  ^(T)  for  a  single-qubit  system  coupled  to 
an  n-spin  bath  in  an  external  magnetic  field,  assuming 
an  isotropic  Heisenberg  interaction  between  the  system 
qubit  and  each  bath  spin.  The  ratio  of  the  bound  from 
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FIG.  6:  Plot  of  Udd/^uii  (Eq.  (l62l)l  versus  ero  for  the  uni¬ 
versal  decoupling  sequence  and  the  time-symmetric  sequence, 
in  the  limit  5 /to  — >  0.  We  have  assumed  770/77  =  2,  and 
have  used  the  value  a  =  log2(291)  «  8.18  appropriate  for  the 
fault-tolerant  gadget  constructions  in  [3ll]. 


Eq.  (1501)  and  Table U  to  the  actual  value  of  ||Sl3(T)||  for 
this  model  is  plotted  in  Fig.  [7]  as  a  function  of  /3tq,  for 
Hg  =  0  and  5  =  0.  The  bound  is  larger  than  the  actual 
value  by  at  least  a  factor  of  20. 


FIG.  7:  Plot  of  1 1  fl3 1 1  Bound/ 1|  1 1  Actual  versus  /Fro  for  the  time- 

symmetric  DD  sequence  Eq.  (ED).  The  noise  Hamiltonian  is 
H  =  Hb  +  HSb  ( H°s  =  0),  where  HB  =  (0/2)52**?  and 
Hsb  =  (J/4)J2a=x,y,zas  ®  (D"=i°f);  the  index  i  labels 
the  bath  spins.  Here  ||03||Bound  is  computed  using  Eq.  (1501) 
and  Table  U  (where  5  =  0  and  T  =  8to),  while  ||f23 1|  Actual  is 
computed  by  evaluating  the  integral  in  Eq.  (1281)  exactly.  The 
kinks  in  the  plots  arise  because  the  operator  norm  can  have 
a  discontinuous  first  derivative  when  eigenvalues  cross. 


C.  Eulerian  decoupling  sequences 

The  effects  of  finite  pulse  width  and  other  pulse  imper¬ 
fections  can  be  suppressed  by  using  an  “Eulerian”  mem¬ 


ory  sequence  [j].  In  Eulerian  decoupling,  the  operator 
applied  by  each  pulse  is  the  generator  of  a  finite  group, 
and  Uc(t)  traverses  an  Euler  cycle  in  the  Cayley  graph  of 
this  group.  As  a  result,  the  error  Hamiltonian  is  group 
averaged  and  first-order  decoupling  is  maintained  even 
when  the  pulses  have  (reproducible)  imperfections.  We 
will  describe  a  simple  Eulerian  memory  sequence  here; 
see  [?]  for  a  more  general  discussion. 

A  simple  Eulerian  memory  sequence  protecting  against 
single-qubit  noise  is  [l| 

Uc(tDU)  =  XIZIXIZIZIXIZIXI.  (77) 


Here  the  pulses  are  equally  spaced  in  time;  each  I  oper¬ 
ators  represents  the  same  time  interval,  and  the  spacing 
between  the  start  of  two  consecutive  pulses  is  To .  This 
sequence  looks  like  two  repetitions  of  the  universal  de¬ 
coupling  sequence,  except  that  the  X  and  Z  pulses  are 
swapped  in  the  second  repetition.  In  contrast  to  the  time- 
symmetric  sequence  Eq.  (El.  we  use  the  same  Hamilto¬ 
nian  Hpx  to  execute  each  X  pulse,  rather  than  reversing 
the  sign  of  the  Hamiltonian  during  the  second  half  of  the 
sequence;  similarly  we  use  the  same  Hamiltonian  Hpz  to 
execute  each  Z  pulse. 

Without  making  any  assumption  about  the  pulse 
widths  or  shapes  (except  for  assuming  that  all  X  pulses 
are  alike  and  that  all  Z  pulses  are  alike) ,  we  may  express 
the  unitary  evolution  operator  over  the  pulse  interval  of 
duration  tq  as  ux{t)  for  an  X  pulse  and  uz{t)  for  a  Z 
pulse.  Then,  for  t  £  [0,toD  =  8t0],  Uc(t)  becomes 


ux(t)  I 

t  £ 

[0,  T0) 

uz(t  - 

T0)X 

t  £ 

[to,2t0) 

ux(t  - 

2  r0)(iY) 

t  £ 

[2t0,3t0) 

Uz(t  - 

3  to)(-Z) 

t  £ 

[3t0,4t0) 

uz(t  - 

4r0)(— I) 

t  £ 

[4t0,5t0) 

Ux(t  - 

5t„)(-Z) 

t  £ 

[5t0,6t0) 

Uz(t  - 

6t-0)(*F) 

t  £ 

[6t0,7t0) 

uxit  - 

7  r0)X 

t  £ 

[7tq,  8tq) 

(78) 


The  first-order  Magnus  term  Oi(tDD)  can  be  expressed 
in  terms  of  effective  Hamiltonians  Hx  and  Hz,  obtained 
by  averaging  the  Hamiltonian  over  an  X  or  Z  pulse  re¬ 
spectively: 


t0Hx  = 


tqHz 


PTO 

0 


dt  u*x(t)Hux[t), 


dt  u^z(t)Huz{t). 


(79) 


Since  Ux  and  uz  act  only  on  the  system,  they  commute 
with  the  bath  Hamiltonian  H p ;  while  averaging  over  the 
pulse  alters  He rr,  it  has  no  effect  on  Hp.  Therefore  we 
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find  that 

pt  DD  _  f*DD 

Hi(£dd)  =  /  dt  H(t)  =  /  dt  Ul{t)HUc{t) 

JO  JO 

=  r0  (fl*  +  XHxX  +  YHXY  +  ZHXZ) 

+  r0  (Hz  +  XHZX  +  YHZY  +  ZHzZ ) 

=  8Hbt0-,  (80) 

thus  Hi(£dd)  is  a  pure  bath  term.  To  derive  the  last 
line  of  Eq.  (1501).  we  have  used  the  property  H  +  XHX  + 
YHY  +  ZHZ  =  AHb  (as  in  Eq.  (|H7|l).  We  conclude 
that  first-order  decoupling  is  perfectly  attained  irrespec¬ 
tive  of  the  pulse  shape,  as  long  as  the  same  ux(Z)(t)  is 
applied  for  every  X(Z)  pulse,  and  the  integrated  pulses 
are  exactly  right. 

Adding  a  gate  pulse  G,  by  combining  G  with  the  final 
X  pulse  of  the  Eulerian  memory  sequence,  introduces  an 
error  depending  on  the  width  of  the  final  pulse.  How¬ 
ever,  because  this  nonvanishing  contribution  to  fii(T) 
arises  only  from  the  final  pulse,  it  does  not  depend  on 
the  length  of  the  memory  sequence.  Other  contributions 
to  fi(T)  that  depend  on  pulse  shapes,  in  the  second  order 
of  the  Magnus  expansion  and  beyond,  are  suppressed  by 
additional  factors  of  €Tq. 

The  contributions  that  depend  on  the  pulse  shape  can 
be  further  suppressed  by  making  the  Eulerian  memory 
sequence  time-symmetric.  Consider,  for  example,  the  se¬ 
quence 

Uc(tnT>) 

=  x(--)iz(-'>ix(-hz(~-hz(-'>ix(-hz(-Hx(--h 

x  IX(+)IZ(+)IX(+)IZ(+)IZ(+)IX(+)IZ(+)Idf(+),  (81) 

where  the  control  Hamiltonian  is  chosen  such  that 

:(-)  (Idd  ~t)=  uX(.+)  ( t )  and  uZ{~)  (£dd  —  t)  =  uZ(+)  (t). 
Because  this  sequence  obeys  the  time  symmetry  condi¬ 
tion  17c(£dd  —  t)  =  Uc(t ),  the  even-order  Magnus  terms 
vanish.  Furthermore,  because  Eq.  m  is  just  two  copies 
of  the  Eulerian  sequence  Eq.  (ED,  the  first  running 
backward  in  time  and  the  second  running  forward,  the 
sequence  achieves  first-order  decoupling  for  any  pulse 
shape.  Corrections  depending  on  the  pulse  shape  en¬ 
ter  only  in  third  order  and  beyond.  Of  course,  making 
the  Eulerian  sequence  time-symmetric  (or  making  the 
time-symmetric  sequence  Eulerian)  lengthens  the  pulse 
sequence  and  so  increases  the  time  T  appearing  in  the 
Magnus  expansion.  Whether  using  this  longer  sequence 
actually  improves  the  noise  suppression  depends  on  the 
values  of  the  parameters  ero,  Stq  and  To/to ,  but  it  could 
pay  off  if  the  pulse  width  is  relatively  large.  Adding  a 
gate  pulse  to  the  time-symmetric  Eulerian  memory  se¬ 
quence  spoils  the  first-order  decoupling  and  breaks  the 
time  symmetry,  but  the  resulting  contributions  to  Hi  and 
H2  depend  only  on  the  width  of  the  final  pulse,  not  on 
the  length  of  the  pulse  sequence. 

Eulerian  DD-protected  gates  that  achieve  exact  first- 
order  decoupling  for  nonzero-width  pulses  can  be  de¬ 
vised  using  the  dynamically  corrected  gates  recently  in¬ 
troduced  in  53,  54.  This  scheme  is  based  on  the  idea 


that  distinct  gates  can  have  related  errors,  so  that  the  er¬ 
rors  cancel  for  a  suitably  constructed  pulse  sequence.  The 
errors  in  distinct  gates  can  be  similar  if  the  gates  are  con¬ 
structed  from  control  unitaries  that  traverse  similar  time- 
dependent  baths,  differing  only  by  rescaling  or  reversing 
the  time  along  the  path.  Arbitrary-order  decoupling  for 
nonzero-width  pulses  can  be  achieved  by  concatenating 
dynamically  corrected  gates  [Hj . 


VII.  DERIVATIONS 


In  this  section,  we  derive  the  coefficients  for  the  bounds 
on  the  Magnus  expansion  listed  in  Table  HI  The  Magnus 
expansion  is  computed  for  the  Hamiltonian  Hm  (t)  given 
in  Eq.  (l49l);  at  any  time  t,  I?m(£)  =  ± HB  +H'(t),  where 
H'(t)  is  either  0  or  HelI(t).  The  two  terms  in  Um(£)  are 
bounded  as  \\HB\\  <  [3  and  ||iJ'(i)||  <  J;  thus  ||.Hm(£)||  < 
P  +  J  =  e.  The  Magnus  terms  can  be  computed  from 
Hm  (t)  using  the  following  recursive  formulas  [56| ,  derived 
in  Appendix  [A] 


m 

Hi  (T) 
n„(T) 

sP(t) 

S^(t) 


—iHM(t); 

(82a) 

[  dtA(t ); 

Jo 

(82b) 

n—l  /*T 

E-f/  dtS$p(t),n>  2; 

U 3 ■ J 0 

(82c) 

[H„_i(£),  A(t)\ ; 
n-j 

(82d) 

E  |n m(£),  S'Em  (*)J  > 2  <  3  <  n 

m—1 

-1,  (82e) 

where  {Bj}  are  the  Bernoulli  numbers.  Explicit  formulas 
for  H2(T)  and  H3(T)  were  given  in  Eqs.  ED  and  J2ED- 


A.  General  case 

For  the  general  (i.e.,  not  time-symmetric)  case,  Table 
G]  gives  Gi  =  NS /T  for  regularly  spaced  pulses  or  2 NS /T 
in  general,  G2  =  1,  C3  =  4/9,  C4  =  11/9  and  G5  =  9.43. 
Now  we  derive  these  coefficients. 


1.  Bound  for  Hj 

We  assume  that  first-order  decoupling  is  attained,  so 
that  in  the  limit  of  zero- width  pulses  H  (t)  for  the  memory 
sequence  satisfies  Eq.  (1311):  f*DD  dtHerito(t)  =  0.  Recall 
that  the  subscript  “0”  on  HelI  means  we  are  to  take  <5 
to  zero  in  HelI(t)  while  holding  To  fixed.  If  a  zero- width 
gate  pulse  is  appended  to  the  memory  sequence,  then 
Herr,o(t)  in  the  DD-protected  gate  differs  from  HeTIjo(t) 
in  the  memory  sequence  only  during  the  final  instanta¬ 
neous  pulse,  and  therefore  still  integrates  to  zero.  Hence 
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the  DD-protected  gate  as  well  as  the  memory  sequence 
satisfies  fii(T)  =  — iTHB  and  ^((T)  =  0. 

When  the  pulses  have  finite  width  ( S  >0),  picks  up 
corrections  that  depend  on  S.  Noting  that  H(t)  differs 
from  Ho(t)  only  during  the  pulses,  we  write 


fii(T)  =  -i  [  dtH (t)  +  it0HB 
Jo 

pto 

=  —  i  /  dtH0(t)  +  ito Hb 
J  o 

pto  pto 


+  *  /  dfpw-ffo(i)  —  *  /  dtpWH(t) 

Jo  Jo 

rto  _  rto 

i  /  dtp^ H q  (t)  —  i  /  dtp^ H(t).  (83) 

Jo  Jo 


Here,  dfpw  indicates  integration  only  over  times  within 
the  pulses.  Now,  H0(f)  =  HB  +  He rr,o{t),  so  for  a  se¬ 
quence  with  N  pulses  (including  the  gate  pulse),  we 
have  i  f*°  dtp^H0(t)  =  iN6HB  +  i  f*°  dtpwHen,o(t)  and 
—%  J0°  dtpwH(t)  =  —iNSHB  —  i  f0°  dtp-^/HelT(J:) .  The 
two  iN5HB  terms  cancel,  and  we  are  left  with 


=  i  f  dtpw-fferr.o(i)  ~  /  dtpW-^err  (t)  ■  (84) 

Jo  Jo 

The  second  term  can  be  upper  bounded  by  NSJ.  For 
the  first  term,  Eq.  (l38l)  tells  us  that  for  5  =  0,  Ho{t)  = 
jj(fc)  _  jjg  -| -  for  t  G  [sfc,Sfc+i).  Hence,  we  have 
that  i  J*0  dtpw-£ferr,o(i)  =  iS^2kHei)  ■  Now,  the  first- 
order  decoupling  condition  can  be  written  as 


f  °  dtHe rr,0  =  V(sfc+i  -  8k)H<g  =  0.  (85) 

Jo 


Since  Hm(J)  =  ±HB  +  H’{t ),  the  integrand 
[Hm(si),  Hm(s2)\  splits  into  a  sum  of  three 
nonzero  terms:  ±[HB,  H'(s2)],  ±[H'(si),Hb]  and 

[H'(si),H'(s2)]-  These  are  respectively  bounded  by 
2 J/3,  2J/3  and  2 J2,  which  gives  for  all  si,  s2  €  [0,  T], 

\\[Hm{si),Hm{s2)]\\  <  2J(2/?  + J)  <  4J e.  (88) 

Putting  this  into  the  norm  of  Eq.  (18711  and  doing  the 
integrals,  we  get 

||n2(r)||  <  2 Je  [  ds  1  /  1  ds2{  1)  =  (JT)(eT),  (89) 

Jo  Jo 

which  gives  C2  =  1. 


3.  Bound  for  fl3 

For  fi3(T),  we  start  from  the  explicit  formula  Eq.  (1281). 
It  is  convenient  to  have  a  more  compact  notation  that  will 
also  be  useful  in  the  time-symmetric  case  later.  With 
A(t)  =  — iHM(t ),  let  us  denote  A(si)  (occurring  in  the 
integrand  of  Eq.  (E51)l  by  its  index  i,  and  let  [ijk]  = 
[H(si),  [H(sj), H(sfc)]].  Furthermore,  let  0(a;)  be  the  step 
function:  0(x  >  0)  =  1  and  <d(x  <  0)  =  0,  and  define 

Q(pq)  =  Q(tp  -  tq),  0 (pqr )  =  &(pq)0(qr).  (90) 

Then  fi3  (T)  can  be  written  compactly  as 

(T)  =  i  J  ds\  J  ds2  J  ds30(  123) 

x  ([123] +  [321]).  (91) 


If  all  the  pulses  are  regularly  spaced  in  time,  so  that 
Sfc+i  —  s*,  are  all  equal  for  all  k ,  this  condition  implies 
that  He rr  =  0.  In  this  case,  the  first  term  of  the  right- 
hand  side  of  Eq.  (l84l)  vanishes  and  H'1  (T)  is  bounded  by 
the  norm  of  the  second  term  only: 

||H'1(T)[|<7V5J=^(JT).  (86) 

Hence,  C\  =  NS/T  if  pulses  are  regularly  spaced  in  time. 
Even  if  the  pulses  are  not  regularly  spaced  in  time,  this 
value  of  C\  works  whenever  2_)fc  He rr  =  0.  Otherwise,  we 
can  still  upper  bound  the  first  term  in  Eq.  (1M1)  by  NSJ, 
so  that  ||Oi(T)||  <  2 NSJ  =  (2N5/T)(JT).  This  gives 
Ci  =  2N8/T  in  general. 


We  need  a  bound  on  the  triple  commutator  [ijk]: 

\\[ijk)\\<2e\\[jk]\\<2e(Ue)  =  8Je2  (92) 
for  any  i,  j,k.  Therefore  we  find 

||n3(r)||  <  J  (16 Je2)  [  dSlds2ds3e(123) 

Jo 


<  ^(JT)(eT)2.  (93) 

Hence,  C3  =  4/9. 


2.  Bound  for  Q2 


f.  Bounds  for  fln>4 


To  bound  fl2(T),  we  use  the  formula  Eq.  (1271)  for 

MT): 

1  rT  rsi 

^2  (T)  =  -~  dsi  ds2[HM{si),HM{s2)\.  (87) 

^  Jo  Jo 


To  bound  the  Magnus  terms  for  n  >  4,  we  use  the 
recursive  formulas  Eq.  (l82l)  and  ideas  from  [57;,  58].  In 
Appendix [Ej  we  show  that  the  Sn'*  operators  satisfy: 

II^WII  <fij)J(2et)n-\  (94) 
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for  all  n  >  2,  1  <  j  <  n  —  1,  where  the  coefficients  flP 
are  given  in  Eq.  (IE2j).  Using  this,  we  can  write  down 
bounds  for  H„> 4  as  follows: 


U  ^  I  R  I 

nn(T)\\<Y,l-r  dsll^WII 

.7=1  -1' 


1  2-4  I B 


<  -ELri^)(JT)(2eT)n"1 

n.  L — ' 


n—  j 
=  fn(JT)(4eT)n~1 , 

where  the  coefficients  /„  are  defined  as 

n— 1 

n2r‘ 


f  =  1  v  —  fU) 

Jn  n2n~1  ^  j\  Jn  ‘ 


j= 1 


(95) 


(96) 


Using  Eq.  (lM]l  and  the  recursive  formula  for  from 
Eq.  (IE2I).  one  can  show  that  / 4  =  11/576.  Then,  rU(T) 
can  be  bounded  as 


l|n4(T)||  <  ^(JT)(4eT)3, 


(97) 


so  C4  =  43 (11/ 576)  =  11/9. 

The  bounds  for  fln  for  n  >  5  can  be  gathered  together 
into  a  single  bound  by  writing 


J2\\^n(T)\\  <  (JT)(4eTp 

n= 5 


^/„(4eiy-5 

_n= 5 


(98) 


In  [Hs|,  the  {/„}  were  shown  to  be  coefficients  in  the 
power  series  expansion  of  G~1(y)  =  fnUn\  G~1(y) 

is  the  inverse  function  of 


V  =  G(s)  =  [ 
Jo 


r , 

X 

(  X\1 

/  ax 

2  +  — 

1  —  cot  —  I 

Jo 

2 

V  2/ J 

(99) 


defined  for  domain  — 27t  <  s  <  2-7T,  the  interval  over  which 
G(s )  is  monotonically  increasing.  A  self-contained  proof 
of  this  fact  is  provided  in  Appendix  [FI  We  want  to  relate 
the  expression  in  the  brackets  in  Eq.  ((Mil  to  G-1.  Define 
C  as 


C  =  G(2tt)  =  2.17374...,  G~1(()=2tt,  (100) 


and  assume  that  eT  <  0.54  so  that  4eT  <  Then, 
G_1(4eT)  <  27t  since  G(s)  is  monotonically  increasing 
over  its  domain,  and  therefore, 


Y,  fn(^T)n-5 

_n= 5 


<  E  fnC~5 

4 


(101) 


Using  fx  =  1,  f2  =  j,  h  =  P2  and  /4  =  i^wffich  can 
be  derived  from  Eq.  (Ml)  and  Eq.  (IE2I) ,  Eq.  (1011)  implies 


E/»(4eTr-5 

_n= 5 


<  0.03685...  =  C'. 


(102) 


Then, 

OO 

El!fi™(T)ll  <  G'(JT)(4eT)4.  (103) 

n= 5 

Therefore,  Cl5s44xCl'~  9.43. 

Note  that  the  condition  eT  <  0.54  is  more  stringent 
than  the  sufficient  condition  for  convergence  of  the  Mag¬ 
nus  expansion  given  in  Eq.  (291).  which  requires  eT  <  it. 
If  0.54  <  eT  <  7T,  we  need  to  use  a  different  method  to 
find  an  upper  bound  on  the  sum  of  the  high-order  Mag¬ 
nus  terms. 


B.  Time-symmetric  case 

Now  we  derive  bounds  on  the  Magnus  terms  that  ap¬ 
ply  when  the  pulse  sequence  is  time-symmetric  except 
inside  a  small  region  A;  as  before,  we  use  A  to  denote 
both  this  region  and  its  size.  As  in  Eq.  (1491) ,  we  are  inter¬ 
ested  in  the  Hamiltonian  Hm  (t)  describing  evolution  for 
time  T  governed  by  the  Hamiltonian  —Hb,  followed  by 
evolution  for  time  T  —  T  governed  by  the  toggling- frame 
Hamiltonian  of  a  DD-protected  gate.  But  our  analysis 
in  this  Section  applies  to  any  Hamiltonian  Hm  (£)  that  is 
time-symmetric  outside  region  A. 

We  bound  the  first-order  term  fl^(T)  just  as  in  the 
general  case,  since  time  symmetry  makes  no  difference  for 
this  term.  But  even  Magnus  terms  vanish  when  A  =  0, 
and  we  will  derive  explicit  A-dependent  bounds  on  ^(T) 
and  H^T’).  We  bound  ^(T)  starting  from  Eq.  (19H).  but 
exploit  the  time  symmetry  to  improve  on  the  bound  that 
applies  in  the  general  case.  We  could  also  exploit  the  time 
symmetry  to  derive  improved  bounds  on  the  higher-order 
Magnus  terms  (Qn>5 (T))-,  however  we  will  not  bother  to 
do  so.  Instead  we  use  the  same  upper  bounds  on  these 
terms  that  apply  in  the  general  case,  with  the  expectation 
that  these  bounds  are  already  quite  small  in  typical  cases 
of  interest. 


1.  Bounds  for  Q2  and  O4 

To  obtain  a  bound  on  ^(T)  for  a  nearly  time- 
symmetric  sequence,  we  note  that  the  double  time  inte¬ 
gral  in  Eq.  (1571)  can  be  split  into  four  cases:  (i)  t±,t2  ^  A, 
(ii)  ii  G  A,t2  </  A,  (iii)  t\  £  A,f2  &  A  and  (iv) 
t\ ,  f2  G  A.  The  contribution  from  Case  (i)  vanishes, 
because  Hm  (t)  is  time-symmetric  in  this  region.  The 
remaining  cases  can  be  bounded  by  first  bounding  the 
integrand  (the  commutator)  and  then  doing  the  integra¬ 
tion.  The  commutator  can  be  bounded  as  in  Eq.  (1881): 
\\[HM(ti),HM(t2)]\\  <  4Je.  Then  doing  the  integration 
gives 

||ft2(T)||  <\{4Je)  [2A(T  —  A)  +  A2] 

=  4-  (1-— ) 

T  \  2  T  ) 


(JT)(eT),  (104) 
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which  vanishes  as  expected  in  the  limit  A  — >  0.  Hence, 
C2  =  4(A/T)  (1  -  A/2 T). 

Similarly,  we  can  derive  upper  bounds  on  all  even  Mag¬ 
nus  terms  that  depend  linearly  on  A/T  to  lowest  order, 
as  discussed  in  Appendix  [DJ  Eq.  (1D7I)  gives  the  same 
bound  on  H2(T)  as  Eq.  (1104|) .  and  for  fi4(T)  yields 


||fi4(T)||  <  14(  JT)(eT)3 


1-(1 


=  14(  JT)(eT)3 


+4 


(105) 


and  note  that 
fT/  2 

0  =  /  dhdt2dt3  (0(321)  +  0(231))  [123],  (110) 

Jo 

since  [123]  is  antisymmetric  under  the  replacement  2  <->  3 
while  0(321)  +  0(231)  is  symmetric.  Thus,  we  obtain 


1  fT/2 

H3(T)  =  -  /  dhdt2dt3  (20(123)  +  20(213))  [123], 

3  Jo 

(111) 

Bounding  the  norm  of  [123]  using  Eq.  (1M1).  and  then 
changing  variables  1  2  in  the  term  with  0(213),  we 

finally  have  that 


Since  4(A/T)3  <  4(A/T)2  and  (A/T)4  >  0,  we  can 
rewrite  this  as 


||tt4(T)||  <  14(JT)(eT)3 


A 


4—  -  2 
T 


T 


T 


=  56-1  (JT)(eT)3. 


2  T 


(106) 


\\n3(T)\\  <  -  (8 Je2) 


rT/2 


dtidt2dt3Q(l23) 


t  (8je2)  (5) 


\  48  y 
(JT)(eT)2, 


(112) 


Hence,  C4  =  56(A /T)  (1  -  A/2 T). 

2.  Bound  for 

To  bound  fi3(T)  in  the  time-symmetric  case,  we  begin 
with  Eq.  m-  Let  us  first  assume  that  Hm  (t)  is  perfectly 
time-symmetric,  so  A  =  0.  In  this  case,  by  dividing  the 
interval  [0,T]  into  two  half-intervals  [0,T/2]  and  [T/2,  T] 
and  using  the  time-symmetry,  we  find  that  we  can  reduce 
the  range  of  integration  for  t3 ,  t2  and  t3  in  Eq.  (liJTl)  to 
[0,  T/2]  while  making  the  replacement 

0(123)  ->  0(123)  +  0(321)  +  0(23)  +  0(21)  (107) 

The  four  terms  in  the  replacement  arise  from  the  four 
cases:  (i)  h,t2,t3  G  [0, T/2],  (ii)  h,t2,t3  G  [T/2,T], 
(iii)  t\  G  [T/2,  T]  and  t2,t3  G  [0,T/2],  and  (iv)  ti,t2  G 
[T/2,T]  and  t3  G  [0,  T/2] .  These  are  the  only  cases  for 
which  0(123)  is  nonvanishing.  Hence,  we  have 

1  (T'2 

Sl3(T)  =  -  /  dhdhdts  ([123]  +  [321]) 
b  Jo 

x  (0(123)  +  0(321)  +  0(23)  +  0(21)) 

1  fT'2 

=  -  I  dhdhdtsl  123]  (108) 

3  Jo 

x  (0(123)  +  0(321)  +  0(23)  +  0(21)) 

where  in  the  second  equality,  we  have  changed  variables 
1  <->  3  in  the  term  containing  [321]. 

Next,  observe  that 

0(23)  =  0(123)  +  0(213)  +  0(231), 

0(21)  =  0(321)  +  0(231)  +  0(213),  (109) 


for  a  perfectly  time-symmetric  Thus  C3  =  2/9  for 

A  =  0.  Observe  that  Eq.  (I112D  has  an  additional  factor 
of  1/2  compared  to  the  corresponding  bound  (Eq.  (1931)1 
in  the  general  non-time-symmetric  case. 

If  HM(t)  is  time-symmetric  except  for  t  G  A,  we  can  do 
an  analysis  similar  to  what  we  did  for  the  even  Magnus 
terms.  Let  H'M(t )  =  HM{t)  for  t  ^  A  and  =  0 

for  t  G  A.  Then,  H'M(t)  is  perfectly  time-symmetric. 
Actually,  we  can  discard  the  A  time  period  for  which 
H'M(t)  =  0  and  glue  the  pieces  together  (shifting  the 
time  t  accordingly)  so  that  we  end  up  with  a  perfectly 
time-symmetric  iL//(t)  for  t  G  [0,T  —  A],  Then,  we  can 
rewrite  the  expression  for  H3(T)  given  in  Eq.  (1!)T1)  as 

H3(T)  =  -  /  dSlds2ds3Q(123)  ([123]  +  [321]) 

6  Jo 

=  i  J (dSlds2ds3)T\AQ(  123)  ([123]  +  [321]) 

+  \j  {dSlds2ds3)A  0(123)  ([123]  +  [321]), 

(113) 

where  (-)t\ a  denotes  the  condition  that  none  of  the  in¬ 
tegration  variables  in  the  parentheses  is  in  A,  while  (-)A 
denotes  at  least  one  of  the  integration  variables  in  the 
parentheses  is  in  A.  The  first  integral  in  Eq.  (II 131)  is  just 
H3(T  —  A)  computed  for  the  perfectly  time-symmetric 
and  hence  can  be  bounded  using  Eq.  (11121).  The 
second  integral  can  be  bounded  by  first  bounding  the 
commutators  using  Eq.  (IM1).  and  then  doing  the  time- 
integral  f  (dsids2ds3)A0(123).  This  time-integral  splits 
into  three  cases:  (i)  exactly  one  of  Si,S2,s3  is  in  A,  (ii) 
exactly  two  of  Si,S2,s3  are  in  A,  and  (iii)  all  three  of 
si,  s2,  s3  are  in  A. 
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In  case  (i),  for  si  £  A  and  s2,  S3  ^  A,  the  time-integral 
is  given  by 


VIII.  CONCATENATED  DYNAMICAL 
DECOUPLING 


J (dsi)A  J (ds2ds3)T\AQ(123) 
<  J (dsi)A  J  {ds2ds3)T\AQ(23) 

pT- A 

=  A  /  ds2dssQ(23) 

Jo 

=  ^A(T- A)2, 


A  concatenated  DD  pulse  sequence  is  a  recursively  gen¬ 
erated  sequence  with  a  self-similar  structure  [9|,  [l(|  ■  For 
example,  from  the  “level-1”  universal  pulse  sequence 

Pi  =  ZIXIZIXI  (118) 


(114) 


we  obtain  the  corresponding  “level-2”  sequence  by  re¬ 
placing  each  pulse  interval  I  in  the  level- 1  sequence  by 
the  complete  level- 1  sequence  p±,  obtaining 


where  in  going  from  the  second  to  the  third  line,  we  have 
evaluated  the  si  integral,  and  glued  the  different  pieces 
in  T\A  into  a  single  continuous  time  interval  [0, T  —  A]. 
The  time-integrals  for  the  other  possibilities  in  case  (i), 
i.e. ,  (s2  £  A,  Sr,  s3  ^  A)  and  (s3  £  A,  s1;  s2  ^  A),  can  be 
bounded  in  the  same  way. 

In  case  (ii),  for  si,s2  £  A  and  S3  ^  A,  a  similar  ar¬ 
gument  as  used  in  Eq.  (11141)  can  be  used  to  bound  the 
time- integral: 

J (dsi)A  J ( ds2)A  J (g?S3)t\aO(123) 

<(T- A)  /  dsids20(12) 

Jo 

=  Ia2(T-A).  (115) 

The  remaining  two  possibilities  in  case  (ii),  i.e.,  (si,  S3  £ 
A ,s2  ^  A)  and  (s2,s3  6  A,si  ^  A),  can  be  bounded  in 
the  same  way. 

In  case  (iii),  we  only  have  one  possibility:  si,s2,S3  £ 
A.  The  corresponding  time- integral  can  be  computed  as 


(dsi)A  j (ds2)A  J (ds3)A&(  123) 


dsids2ds30(123) 


=  -A3. 

6 

Finally,  we  have  that 

iifiscmi 

A)]  [e(T  —  A)]2  +  ^(8Je2) 

|a (T  -  A)2  +  |  A2(T  -  A)  +  A3 

=  2-{JT){eT?  (l  + 15^-15^  +  ^) 


<  \{JT){eTf[  1  +  15  4 


1  - 


14  /  A 


15  V  T 


(116) 


(117) 


where  in  the  last  line  we  have  used  A3/T3  <  A 2 /T2. 
This  yields  C3  for  the  time-symmetric  case  as  stated  in 
Table  U 


Pi  =  Zp1Xp1Zp1Xp1;  (119) 

similarly,  the  level-fc  sequence  is 

pk  =  Zpk-iXpk-iZpk-xXpk-i-  (120) 

If  the  duration  of  a  single  pulse  is  To  and  p\  is  an 
-R-pulse  sequence  that  achieves  first-order  decoupling, 
then  the  corresponding  level-fc  sequence  pk  has  dura¬ 
tion  =  Rk  To  and  achieves  fcth-order  decoupling;  i.e., 

has  effective  noise  strength  0(Jek).  Concatenated  pulse 
sequences  are  less  efficient  than  sequences  with  nonuni¬ 
form  pulse  intervals  that  achieve  fcth-order  decoupling 
with  fewer  pulses,  but  nevertheless  have  some  nice  prop¬ 
erties.  For  one  thing,  the  concatenation  method  can 
be  applied  to  enhance  any  pulse  sequence  that  achieves 
first-order  (or  higher-order)  decoupling,  while  “optimal” 
pulse  sequences  [13L  l39l  l4ll.  1591]  are  known  only  for  qubits. 
Furthermore,  concatenated  pulse  sequences  are  relatively 
robust  against  pulse  imperfections,  because  pulse  errors 
arising  at  each  level  get  suppressed  at  higher  levels.  We 
will  analyze  the  performance  of  concatenated  pulse  se¬ 
quences  in  two  ways,  first  using  the  Magnus  expansion, 
and  then  in  Sec.lXE|using  the  Dyson  expansion  and  bath 
correlation  functions. 


A.  Magnus  expansion  analysis 

The  noise  Hamiltonian  has  an  unambiguous  decompo¬ 
sition  into  two  parts:  H  =  Hg+He rr,  where  Hg  =  I'SiBq, 
Hen  =  Sa  Ba,  and  {S'q}  is  a  basis  for  the  trace¬ 
less  operators  acting  on  the  system.  For  a  level- 1  pulse 
sequence  with  duration  T ,  the  toggling-frame  time  evo¬ 
lution  operator  is  U(T)  =  exp(S2(T));  writing  f l(T)  = 
—iH^T,  we  may  regard  H W  as  the  level-1  “effective 
Hamiltonian.”  Like  H ,  has  an  unambiguous  de¬ 

composition  into  two  parts, 

u(1)  =  1 +  ffW  =  I  <g>  +  Y,  Sa  <8  flW ,  (121) 

a 

and  we  may  define  parameters  that  characterize  the  ef¬ 
fective  noise  at  level  1: 

<  /3(1),  \\Hg>\\  <  J{1\  =  0™  +  j(D. 

(122) 
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Now,  we  can  analyze  the  level-2  pulse  sequence  just  as  we 
did  the  level-1  sequence,  but  with  the  level-0  noise  Hamil¬ 
tonian  H  =  replaced  by  the  level- 1  effective  Hamil¬ 
tonian  H W.  Proceeding  in  this  way,  we  can  estimate 
properties  of  the  toggling-frame  time  evolution  operator 
U for  the  level-fc  pulse  sequence  using  the  level- (fc— 1) 
Hamiltonian  77+-1}.  At  each  level,  we  can  define  noise 
parameters  (3^k\  J^k\  and  e+(  as  in  Eq.  (I122D.  and  derive 
recursion  relations  that  relate  the  level-fc  noise  parame¬ 
ters  to  level- (fc— 1)  noise  parameters. 

To  understand  how  first-order  decoupling  is  achieved 
by  the  level- 1  sequence,  we  assumed  that  the  toggling- 
frame  Hamiltonian  is  constant  in  the  interval  between 
pulses.  For  the  concatenated  sequence  at  level  2  and 
above,  this  assumption  is  not  true,  since  the  interval  in 
between  the  level-fc  pulses  contains  a  complex  level- (fc— 1) 
pulse  sequence.  However  the  unitary  operator  describing 
the  evolution  from  the  end  of  one  level-fc  pulse  to  the 
beginning  of  the  next  level-fc  pulse  is  equivalent  to  the 
evolution  operator  that  would  have  been  derived  from 
the  constant  Hamiltonian  H^k~1'1  during  the  pulse  in¬ 
terval.  Thus  for  the  purpose  of  understanding  the  time 
evolution  in  the  toggling  frame  resulting  from  the  level-fc 
sequence,  it  does  no  harm  to  imagine  that  the  Hamilto¬ 
nian  is  constant  between  pulses  and  do  the  analysis  just 
as  for  the  level-1  sequence. 

For  a  sequence  that  achieves  first-order  decoupling,  Hi 
at  each  level  is  a  pure  bath  term 

n(fc)(T(fc))  =  (123) 

where  T ^  =  Rk  ro  is  the  duration  of  the  level-fc  sequence, 
constructed  by  concatenating  fc  times  a  sequence  with  R 
pulses.  Suppose  we  consider  a  pulse  sequence  such  that 
each  pulse  either  commutes  or  anticommutes  with  each 
of  the  traceless  operators  in  the  set  {Sq}  (the  argument 
below  can  be  easily  adapted  to  more  general  pulse  se¬ 
quences).  Under  this  assumption,  the  second-order  term 
H2  in  the  Magnus  expansion  has  no  pure-bath  compo¬ 
nent  (see  Appendix  iGl)  and  thus  contributes  only  to  He)}  ■ 
Therefore  arises  from  and  the  pure  bath  com¬ 
ponent  of  =  YlnL 3  ^"C')  ■  As  shown  in  Appendix  [HJ 
the  norm  of  the  pure-bath  component  of  H)  is  no  larger 
than  ||H^||;  it  follows  that  we  may  choose  such  that 
^(.k)j’(k)  jg  an  Upper  bound  on 

||HW(T)||  +  ||Hg(T)||.  (124) 

From  Eq.  (15011  and  Table  Q]  we  see  that 

||n^||  <  4fc)  (jO-Utw)  (e(fc-1)r(fc))2 ,  (125) 

(k) 

where  the  “constant”  C3  actually  depends  on  the  value 
0f  £(fc-l)  J ’(fc); 

cf]  =  ^  +  ¥  (e(fe_iy(fe))  +9.43  ,  (126) 


assuming  e^k  <  .54  ( e.g .,  =  0.67  for 

e(fc-i)T(fc)  =  0.1  and  c+)  =  0.46  for  e^k~^T^  =  0.01). 
Recalling  Eq.  (I123D.  we  conclude  that 

p(k)  =  p{k- 1)  +  c(fc)  j(fc-i)  ^e(fe-i)T(fe)^2  .  (f27) 

Though  Eq.  (112711  has  been  expressed  as  an  equality,  the 
right-hand  side  is  actually  an  upper  bound  on 
The  level-fc  error  Hamiltonian  arises  from 

and  the  traceless  component  of  .  It  is  shown  in  Ap¬ 
pendix  El  that  the  norm  of  the  traceless  component  of 
n{k)  is  no  larger  than  2||H^||i  therefore  we  may  choose 
jW  such  that  j(fc)T+)  is  an  upper  bound  on 

||HW(T)||+2||Hg(T)||.  (128) 


(If  the  system  is  a  single  qubit,  then  the  norm  of  the 
traceless  component  of  f is  no  larger  than  ||fi^  ||,  and 
thus  the  factor  of  2  in  the  second  term  can  be  omitted.) 
Therefore,  again  using  Eq.  (15U1)  and  Tabled] we  find 


j(fc)  =  ci2k)J(k-1)  (e^-^T^  ,  (129) 


where 


4fe)  =  1  +  2 


-(e(fe-1)T(fc))  +  —  ^e(k-1)j'(k)y 


+9.43(e(fc“1)T(fc)); 


(130) 


assuming  e^k  ^T1^)  <  .54  (e.g.,  c ^  =  1.133  for 
e(fe-i)T(fc)  =  0.1  and  c{k)  =  1.01  for  e(k-i)T(k)  =  00l). 
Eq.  (112711  can  be  rewritten  as 

p{k)  =p(k-i)  +K(k-i)  (131) 

where 

=  C(U  ^j(.k-i)T(k)\  ^e(fe-i)T(fe)^2  ;  (132) 

and  iterating  this  equation  yields 

/3(fe)  =  /?  +  AT«  +  AT(2)  +  •  •  •  +  (133) 

and 

e(k)  =  p(k)  +  j(k) 

=  /3  +  AT(1)+A'(2)  +  ---+A:(fc“1)+ J(fe). 

(1341 


The  solution  to  the  recursion  relations  Eq.  (1129H.  (113211. 
(11341)  cannot  be  expressed  easily  in  closed  form,  but  the 
properties  of  the  solution  can  be  grasped  if  we  assume 
that 


4fe)e(fc-1) 


<  ce 


(135) 
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for  each  k,  where  c  is  a  constant.  That  is,  if  we  iterate 


the  recursion  relations  to  estimate  our  assumption 

is  that  Eq.  (|135|)  is  satisfied  for  all  k  <  L  Then  using 
Tik)  =  Rkr0,  we  can  replace  Eq.  (|129|)  by 

j(fc)  =  (cero)  Rk  J{k~l) , 

(136) 

which  has  the  solution 

j(fc)  =  (ceT0)k  Rk(k+1)/2J 

(137) 

where  =  J  [6(j. 

The  effective  noise  strength  for  the  level-fc  sequence  is 

Tdd  =  \\H^\\T^  =  J^Rkr0 
=  Rk (fc+3)/2  (cer0)fc  (Jr0), 

(138) 

so  that 

VDD  ~  H  (C€To)77Dd  5 

(139) 

therefore  the  optimal  suppression  of  the  noise  strength  is 
achieved  by  choosing  the  level  k  to  be  the  largest  integer 
such  that  Rk+1(ceTo)  <  1,  or  equivalently, 


fcmax  =  Ll°gfl(1/ce'ro)  -  1J.  (140) 

where  |_- J  denotes  the  “floor”  function.  For  example, 
if  cer0  =  10-3  and  R  =  4,  we  choose  fcmax  =  3 
( i.e .,  a  sequence  with  duration  T ^  =  64ro)  and  obtain 
?7dd/(</to)  =  2.6  x  1CD4,  an  improvement  by  a  factor  of  60 
over  the  noise  strength  achieved  by  the  level- 1  sequence. 

The  expression  for  r/{^  in  Eq.  (113811  is  the  exponen¬ 
tial  of  a  quadratic  function  of  k ,  minimized  at  k  = 
log(l/cer0)  —  3/2.  The  nearest  integer  differs  from  this 
optimal  value  by  at  most  1/2;  substituting  k  +  1  = 
log(l/cero)  into  Eq.  (11381).  we  conclude  that  the  optimal 
effective  noise  strength  satisfies 

v£d)/{Jt0)  <  R -1  (cerop  los*(1/e"»>M  .  (141) 

If  we  concatenate  a  time-symmetric  pulse  sequence, 
which  achieves  second-order  decoupling,  then  we  may  re¬ 
place  Eq.  (11291)  by 

j(fe)  =  2 (142) 

(the  factor  of  2  can  be  omitted  if  the  system  is  a  qubit), 
and  we  can  also  improve  the  estimate  of  C3  to 

4fe)  =  -  +  9.43(e(fc_1)T(fe))2,  (143) 
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where  e^k~1')T^k'>  <  .54.  Defining  c  for  a  time-symmetric 
sequence  by  2 (e^fe_1^)2  <  (ce)2,  Eq.  (11421)  becomes 

J{k)  =  (cer0)2  R2kJ{k~l) ,  (144) 

which  has  the  solution 

j(fc)  =  (cer0)2fc  Rk{k+1)J,  (145) 


and  thus 

4)D  =  Rk(k+2)  &T0)2k  (Jro).  (146) 

The  noise  strength  is  optimized  by  choosing  the  largest 
integer  k  such  that  k+  |  <  logfl(l/cero).  For  example,  if 
R  =  8  and  cero  =  10-3,  we  choose  k  =  2  (i.e.,  a  sequence 
with  duration  T ^  =  64ro)  and  obtain  77DD / (Jro)  =  1.7x 
10— 5 ,  an  improvement  by  a  factor  of  30  over  the  noise 
strength  achieved  by  the  level-1  sequence.  The  optimal 
noise  strength  satisfies 

r?DDt)/(Jro)  <  R~3/ 4  (cer0)loSB(1/5eTo)-2  .  (147) 

Optimal  noise  strengths  for  the  universal  and  time- 
symmetric  sequences,  plotted  in  Fig.  [HI  are  orders  of  mag¬ 
nitude  lower  than  the  noise  strengths  achievable  without 
concatenation,  shown  in  Fig.  [5]  Though  longer,  the  time- 
symmetric  sequence  performs  much  better  when  cer 0  is 
sufficiently  small. 


logio(cer0) 


FIG.  8:  Upper  bounds  on  the  effective  noise  strengths 
achieved  by  the  concatenated  universal  DD  pulse  sequence 
(blue  dashed  line,  based  on  Eq.  (11411)  with  R  =  4)  and  by  the 
concatenated  time-symmetric  DD  pulse  sequence  (red  solid 
line,  based  on  Eq.  (11471)  with  R  =  8),  as  a  function  of  cero, 
where  c  is  defined  in  the  text. 


B.  Including  pulse  errors 

How  is  this  analysis  affected  if  the  pulses  are  imper¬ 
fect?  The  answer  depends  on  the  degree  to  which  the 
pulse  errors  are  systematic  and  reproducible,  rather  than 
random.  As  in  our  discussion  of  Eulerian  decoupling,  let 
us  assume  that  the  errors  are  systematic.  This  assump¬ 
tion  is  reasonable  if  the  pulse  errors  arise  from  the  time- 
independent  noise  Hamiltonian  that  is  “on”  during  the 
pulses,  rather  than  from  variations  in  the  pulse  shape. 

In  the  recursive  analysis  of  the  concatenated  pulse  se¬ 
quence,  the  effective  Hamiltonian  7/4fe_1)  incorporates  all 
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the  damage  caused  by  the  pulses  errors  at  level  fc  —  1  and 
below.  Because  the  pulse  errors  are  systematic,  we  may 
use  the  same  to  describe  the  noise  in  each  inter¬ 

val  between  level-fc  pulses.  Suppose  we  imagine,  at  first, 
that  while  the  pulses  at  level  k  —  1  and  below  are  noisy, 
the  pulses  at  level  k  are  ideal,  and  denote  by  J ^  the 
upper  bound  on  ||  under  this  fictitious  assumption. 

Repeating  the  derivation  of  Eq.  (11291)  yields 

J(fc)  =  4fc)J(fc“1)  (e(fe-1)r(fe))  .  (148) 


But  now  we  must  relate  J ^  to  J ^  by  estimating  the 
effects  of  the  pulse  errors  at  the  top  level. 

The  noise  in  these  level-fc  pulses  is  governed  by  the 
level-0  error  Hamiltonian  rather  than  the  effective 
level-(fc— 1)  error  Hamiltonian  H&r1'*  •  We  could  adapt 
our  analysis  of  the  Magnus  expansion  to  this  new  situ¬ 
ation,  using  a  different  upper  bound  on  Herr  during  the 
pulses  than  in  the  interval  between  pulses,  but  then  we 
would  face  the  complication  of  revising  our  estimate  of 
all  the  higher-order  terms  in  the  expansion.  To  avoid 
that  complication,  we  use  a  different  approach.  As  in 
Sec.  IVII A  11  we  assume  that  the  Hamiltonian  describing 
the  sequence  of  noisy  pulses  at  level  k  deviates  in  opera¬ 
tor  norm  from  the  Hamiltonian  describing  the  sequence 
of  ideal  pulses  at  level  k  by  at  most  2  J  during  a  total 
time  interval  RS,  if  there  are  R  pulses  each  with  width 
6.  It  then  follows  from  Lemma  [3]  in  Appendix  [Cl  that 


<  2  R5J, 


(149) 


where  f includes  pulse-error  corrections  at  all  levels 
while  includes  pulse-error  corrections  at  level  k  —  1 
and  below  but  not  at  level  k.  From  Eq.  (Jl9j)  in  Appendix 
HI  we  find  that 


n(k)  _  fj(fc) 


<  d(fc) 


<  2 dwR5J,  (150) 


where  the  “constant”  d ^  is  close  to  one  if  and 

||H(fe)||  are  both  small;  therefore  we  obtain  an  upper 
bound  on 


j(k)  <  J(fc)  +  2  _  Q(k)  /T(k) 

„WJ(fe-1)  ^e(fc-i)T(fc)j  +  4d(fc) RSJ/T^ . 


<  Co 


(151) 


If  at  each  level  the  second  term  in  Eq.  (11511)  is  small 
compared  to  the  first  term,  then  our  previous  analysis  of 
the  pulse  sequence  remains  a  good  approximation,  and 
we  conclude  that  the  pulse  errors  do  not  compromise  the 
effectiveness  of  concatenated  DD  very  much.  However, 
the  second  term  imposes  a  floor  on  (our  upper  bound  on) 
the  effective  noise  strength 

=  jWrW  >  4 d^RSJ  >  4 R5J.  (152) 


As  the  level  k  increases,  rj falls  as  in  Eq.  (11381)  as 
long  as  it  remains  well  above  the  floor,  but  reaches  a 
plateau  as  the  floor  is  approached.  Such  behavior  was  ob¬ 
served  in  the  numerical  simulations  reported  in  0.  This 
floor  might  be  substantially  suppressed  by  using  Eulerian 
pulse  sequences  as  in  Sec.  IVI  Cl 

A  noteworthy  property  of  Eq.  (11511)  is  that  only  the 
pulse  errors  at  the  top  level  appear  explicitly  on  the  right- 
hand  side.  The  errors  at  lower  levels  are  included  im¬ 
plicitly,  through  their  contributions  to  J and  . 

Accordingly,  Eq.  (I151D  captures  the  idea  that  the  cumu¬ 
lative  effect  of  the  errors  in  the  Rk  pulses  is  smaller  than 
might  have  been  naively  expected,  because  errors  that 
occur  at  lower  levels  in  the  pulse  sequence  become  sup¬ 
pressed  by  the  upper  level  pulses.  This  is  an  important 
feature  of  concatenated  DD. 


IX.  BEYOND  THE  LOCAL-BATH 
ASSUMPTION 

A  key  element  of  the  noise  model  formulated  in  Sec.  |H] 
is  the  local-bath  assumption:  at  any  given  time,  the  noise 
Hamiltonians  Ha  and  Hi,  associated  with  distinct  circuit 
locations  a  and  b  act  not  only  on  disjoint  sets  of  qubits 
but  also  on  disjoint  baths.  This  assumption  is  impor¬ 
tant  because  it  allows  us  to  ignore  interactions  among 
different  circuit  locations  and  thus  assign  an  effective 
noise  strength  r/n d  to  each  DD-protected  gate  individ¬ 
ually.  The  local-bath  assumption  may  be  a  reasonable 
approximation  to  noise  in  actual  systems,  at  least  in  some 
cases,  but  it  is  not  strictly  satisfied;  surely  there  are  bath 
degrees  of  freedom  that  couple  to  multiple  qubits,  even 
while  these  qubits  are  participating  in  distinct  gates.  Can 
our  analysis  be  extended  to  noise  models  that  include  cor¬ 
relations  that  arise  because  qubits  participating  in  dif¬ 
ferent  gates  at  the  same  time  couple  to  common  bath 
variables? 

Accuracy  threshold  theorems  have  been  proved  for 
Hamiltonian  models  of  correlated  noise  in  [3l],  [32|,  j33']. 
Perhaps  similar  methods  can  be  applied  to  DD-protected 
circuits,  but  this  seems  to  be  a  technically  challenging 
problem  which  we  leave  for  the  future. 

However,  there  is  an  easier  problem  that  already  arises 
when  we  consider  just  a  single  circuit  location,  and  disre¬ 
gard  how  the  noise  at  one  location  is  correlated  with  the 
noise  at  another  location.  How  is  our  analysis  affected 
if  the  qubits  at  this  location  couple  not  just  to  a  local 
bath  comprised  of  nearby  bath  degrees  of  freedom  but 
to  a  global  bath  that  includes  bath  variables  that  are  far 
away?  Of  course,  our  previous  analysis  still  applies  if  we 
replace  the  norm  ||iLs)Q||  of  the  local-bath  Hamiltonian 
by  the  norm  ||iLs||  of  the  global-bath  Hamiltonian  in 
Eq.  (0)  and  Eq.  (fTOl) ,  but  the  trouble  with  this  approach 
is  that  \\Hb\\  is  a  huge  number  that  scales  linearly  with 
the  volume  of  the  bath,  while  an  accuracy  threshold  cri¬ 
terion  should  be  stated  in  terms  of  intensive  quantities 
that  are  independent  of  the  size  of  the  system  and  bath. 
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On  the  other  hand,  we  expect  on  physical  grounds  that 
the  bath  has  a  decomposition  into  local  subsystems,  and 
that  the  coupling  of  a  given  bath  subsystem  to  a  sys¬ 
tem  qubit  decays  as  the  distance  increases  between  the 
bath  subsystem  and  the  qubit;  if  in  contrast  each  sys¬ 
tem  qubit  were  coupled  with  constant  strength  to  bath 
subsystems  arbitrarily  far  away,  the  noise  would  be  unac¬ 
ceptably  strong  and  coherent  manipulation  of  the  system 
would  be  hopeless.  Even  though  the  local-bath  assump¬ 
tion  formulated  in  Sec  [XT]  may  not  hold  exactly,  a  sensible 
noise  model  should  be  quasi-local  —  qubits  ought  to  in¬ 
teract  only  very  weakly  with  bath  subsystems  that  are 
far  away.  In  this  case,  can  we  express  the  effective  noise 
strength  in  terms  of  intensive  quantities? 

To  be  concrete,  consider  a  noise  model  in  which  a  single 
system  qubit  is  immersed  in  a  bath  of  Nf,  non-interacting 
spins  in  an  external  magnetic  field.  The  noise  Hamilto¬ 
nian,  assuming  H$  =  0,  is 

H  =  HB  +  HSB  =  J2HB,i  +  J2HsB’i’  (153) 

i  i 

where 

HB,i  =  Is  ®  B® , 

(154) 

Oi 

Here,  the  index  i  =  1, . . . ,  Nf,  labels  the  bath  spins  and 
{(Tq,,  a  =  1,2,3}  are  the  Pauli  operators  acting  on  the 
system  qubit.  We  may  define  the  strengths  of  the  indi¬ 
vidual  terms  as 

Ai  =  \\HSB,i\\  and  bt  =  \\HB<i\\  =  \\B^\\  ,  (155) 

and  the  strength  of  the  system-bath  coupling  can  be  char¬ 
acterized  by 

J  =  J2xi  >II^b||;  (156) 

i 

we  assume  that  the  sum  converges  to  a  (small)  finite  value 
in  the  limit  Nf,  — >  oo.  On  the  other  hand,  the  quantity 

/?  =  £>>||ffB||  (157) 

is  not  expected  to  remain  bounded  as  Nf,  — >  oo. 

Now  consider  how  the  bath  parameters  {bf}  enter  the 
Magnus  expansion  for  a  DD  memory  sequence  or  for  a 
DD-protected  gate  applied  to  the  system  qubit.  The 
Hamiltonian  Hm  (t)  is 

HM(t)  =  Hb  +  H\t)  =  Y  HB,i  +  H\t),  (158) 

where  H'(t)  =  0  or  HelT(t)(=  HsB{t ))  as  in  Sec. IV  A1  so 
that  ||ff'(f)|  <  J.  Furthermore,  bath  operators  acting 
on  different  bath  spins  commute: 

[Bl  B°]  =  [£?,  Bf]  =  [Bf,Bf]  =  0 ,  VM  3\  (159) 


the  only  nonvanishing  commutators  of  bath  operators  are 
[B^,Bf]  and  [Bf,Bj]  (for  any  spin  i). 

The  bath  parameters  {bf}  do  not  contribute  to  fli(T), 
so  consider  fl2  (T).  To  estimate  the  integral  in  Eq.  (1571) 
(taking  T  =  0  so  that  /7m(^)  =  H(t)),  we  need  an  upper 
bound  on  the  commutators.  We  observe  that 

||[77B,Ferr(S2)]||  =  ||  B ,i,  H S B ,i(s2))\\ 

i 

<  Y  2||^B,i||  •  \\HSB,i(s2)\\  <  2 bJ,  (160) 

i 

where  we  have  defined  the  single-spin  bath  parameter 

b  =  max  ||-ffB.i||  .  (161) 

i 

We  also  observe  that 

||[#err(si),-fferr(s2)]||  <  ||  Y^SB,i(si),  HSB,j  (s2)]  || 

<Y2Xixj=2j2-  (162) 

id 

Together,  Eq.  (I160H  and  Eq.  (|162|)  imply 

\\[Hm(s1)1Hm(s2)\\  <45J  +  2J2,  (163) 

and  plugging  Eq.  (11631)  into  Eq.  (1571)  yields 

||fi2(T)||  <  (46J  +  2J2)  Jr2  <  (JT)l(b  +  J)T\.  (164) 

This  result  matches  the  conclusion  we  reached  under  the 
local-bath  assumption  (see  Eq.  (1551)  and  Eq.  (l89l)l.  but 
with  P  now  replaced  by  b. 

Similarly,  upper  bounds  on  the  higher-order  Magnus 
terms  can  be  also  be  expressed  in  terms  of  J  and  b , 
though  the  “replace  P  by  b  rule”  does  not  quite  work 
beyond  second  order.  Consider,  for  example,  one  triple 
commutator  that  occurs  in  fl3  (T): 

mB,[Heir(s2),HeiI(s3)}}\\ 

=  II  Y1Hb,Ic,  [HsB,i(s2),HsB,j(s3)]]\\ 

ijk 

~  E  (lH-^B^,  [HsB,i(s2),HsB,j{S3)}}\\ 

ij 

+  11  Wbj,  [• HSB,i(s2 ),  HSB,j{s3)]} ||^ 

<  2(2 b)  Y  2 A, A,  =  8 bJ2.  (165) 

i,3 

In  contrast,  in  the  local-bath  model  we  could  upper 
bound  the  corresponding  triple  commutator  by  4/3 J2. 
Simply  replacing  P  by  b  gives  the  wrong  answer  by  a  fac¬ 
tor  of  2,  because  it  fails  to  take  into  account  that  there 
are  two  different  bath  spins  that  do  not  commute  with 
[HsB,i(s2),  HsBj(s3)]  for  i  j.  Similar  factors,  depen¬ 
dent  on  n,  occur  in  the  higher-order  nested  commutators 
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contributing  to  fl„(T),  but  these  factors  do  not  depend 
on  the  total  number  of  bath  spins  Nb-  We  could  also  in¬ 
clude  quasi-local  interactions  among  the  bath  spins,  and 
still  obtain  an  upper  bound  on  each  Magnus  term  ex¬ 
pressed  in  terms  of  intensive  quantities. 

Even  when  our  bounds  on  the  Magnus  expansion  are 
intensive,  they  might  still  be  useless,  if  each  local  bath 
subsystem  has  a  Hamiltonian  with  a  large  norm.  In  that 
case,  though,  there  is  another  method  that  might  suc¬ 
ceed,  which  relates  the  effective  noise  strength  to  the  fre¬ 
quency  spectrum  of  bath  correlations.  We  turn  to  that 
method  next. 


X.  DYNAMICAL  DECOUPLING  AND  BATH 
CORRELATIONS 


So  far,  we  have  described  how  to  analyze  the  perfor¬ 
mance  of  DD  using  the  toggling  frame  and  the  Magnus 
expansion.  Another  method  is  to  use  the  interaction  pic¬ 
ture  defined  by  Hc(t )  +  Hb)  that  is,  to  transform  away 
both  the  control  sequence  acting  on  the  system  and  the 
free  bath  dynamics.  In  that  case,  the  interaction-picture 
Hamiltonian  is 


m 


um 


Herr  [Uc(t)  ®  UB(t)} 


Y,sa(t)  ®  Ba(t),  (166) 


where 


Sa{t)  =  Uc(tVSaUc(t),  Ba{t )  =  eitB°Bae~itB° , 

(167) 

and  we  can  study  the  interaction-picture  time  evolution 
operator  using  the  Magnus  expansion  defined  by  this 
Hamiltonian.  This  expansion  has  the  big  advantage  that 
the  interaction  picture  sums  up  the  effects  of  the  free  bath 
dynamics  to  all  orders  in  /?;  therefore,  higher-order  cor¬ 
rections  are  small  provided  J  is  small,  even  though  (3  may 
be  large.  But  there  is  also  a  substantial  disadvantage:  be¬ 
cause  the  interaction  picture  bath  operator  Ba  (t)  is  now 
time  dependent,  a  pulse  sequence  that  achieves  first-order 
decoupling  in  the  toggling  frame  may  not  achieve  first- 
order  decoupling  in  the  interaction  picture. 

On  the  other  hand,  if  the  bath  operator  Ba(t)  is  in 
some  sense  slowly  varying,  then  first-order  decoupling 
might  be  satisfied  to  a  good  approximation.  Though  the 
rate  of  change  of  the  operator  Ba(t )  is  actually  of  order 
/ 3 ,  if  the  state  of  the  bath  has  suitable  properties,  then  the 
expectation  value  of  Ba  (t)  in  that  state  may  vary  slowly; 
then  DD  may  work  well  because  the  typical  frequencies 
of  the  bath  are  sufficiently  small,  even  though  (3  may  be 
large. 

When  estimating  ??dd  using  the  Magnus  expansion, 
we  did  not  make  any  assumption  about  the  state  of  the 
bath.  The  new  estimates  we  derive  in  this  Section  de¬ 
pend  on  the  bath’s  frequency  spectrum  and  hence  im¬ 
plicitly  on  the  bath’s  state.  In  order  to  obtain  a  simple 


formula  for  7?d d  we  will  impose  a  further  limitation  on 
the  noise  model  that  was  not  needed  in  the  Magnus  ex¬ 
pansion  analysis  —  we  assume  that  the  state  of  the  bath 
is  discarded  at  the  end  of  each  circuit  location,  and  re¬ 
placed  by  a  fresh  bath  state  at  the  beginning  of  the  next 
location.  Thus  we  will  include  the  effects  of  the  bath’s 
memory  in  analyzing  the  effectiveness  of  the  DD  pulse 
sequence  at  each  circuit  location,  but  we  assume  that 
noise  correlations  between  consecutive  circuit  locations 
can  be  neglected.  We  recognize  the  artificiality  of  this 
noise  model,  but  we  adopt  it  anyway  because  it  allows  us 
to  derive  an  explicit  expression  for  ?7dd-  See  Appendix  QT] 
for  further  discussion. 


A.  Dyson  expansion 


In  the  toggling  frame,  it  is  convenient  to  analyze  DD 
using  the  Magnus  expansion  because  for  a  well  chosen  se¬ 
quence  of  ideal  pulses  Hi  is  a  pure  bath  term,  and  the  re¬ 
maining  noise  acting  on  the  system  resides  in  the  higher 
order  terms.  But  if  we  use  the  interaction  picture  in¬ 
stead,  so  that  first-order  decoupling  is  not  exact  even  for 
ideal  pulses,  it  is  simpler  to  estimate  the  effective  noise 
strength  ?7dd  using  the  Dyson  expansion  rather  than  the 
Magnus  expansion.  The  interaction-picture  time  evolu¬ 


tion  operator  U(t)  =  U%(t)  ®Ug(t)  U(t,  0)  is 


U{t)  =  T  exp  (-i  dt'H(t')j  (168) 

where  T  denotes  time  ordering.  For  the  local-bath  noise 
model,  augmented  by  the  assumption  that  the  bath  is 
refreshed  at  the  beginning  at  each  circuit  location,  the 
arguments  in  Appendix  [j]  show  that  the  noise  strength  fj 
can  be  expressed  as 

?  =  ^((c'm-^t/m-1)) 

=  max/21  -U(T) -Ut(T)\;  (169) 

Q,|\t)  \  / 


here  T  is  the  duration  of  the  location,  the  expectation 
value  (•)  is  evaluated  in  the  pure  state  jT)  ®  |$a)  where 
|<f>a)  is  (a  purification  of)  the  initial  state  of  the  bath  at 
the  beginning  of  location  a,  and  the  maximum  is  with 
respect  to  all  circuit  locations  and  all  system  states.  As 
is  also  shown  in  Appendix  [j] 

fj2  <  max  J  dt\  J  dt2  ^H(ti)H(t2)^ 

+  2  (eJT  -l-JT-  i(  JT)2^  .  (170) 
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For  each  term  in  the  expansion  Eq.  (11661)  the  expectation 
value  in  the  product  state  factorizes  and  we  have 


f  < 


dtidt2  ^2  (sa(ti)Sp{t2))s  {Ba{ti)Bp(t2)) 2 

a, (3 


for  all  a  and  for  m  =  0,1,2, ...  ,n  —  1.  Denoting  the 
norm  of  the  operator  Sa<n  by  Ca^n,  we  find  that  for  a 
pulse  sequence  achieving  nth-order  decoupling,  the  noise 
strength  is 


+  2  (  eJT  —  1  —  JT  —  -(JT)2 


) ,  (171) 

7?DD  < 

Ca,nC0,nJl0^T2  (iOiT)2n 

1/2 


+  ...,  (179) 


where  the  maximum  over  circuit  locations  and  system 
states  is  implicit. 

Now  suppose  that  the  bath’s  time  correlations  are  sta¬ 
tionary,  i.e.,  that  the  expectation  value  (Ba(ti)B0(t2))  b 
is  a  function  of  the  time  difference  t\  —t2\  this  will  be  true 
if  the  initial  state  of  the  bath  commutes  with  H b  ,  for  ex¬ 
ample  if  the  state  is  a  mixture  of  energy  eigenstates  such 
as  a  thermal  state.  Then  the  bath  correlation  function 
may  be  expressed  as 


where  now  the  ellipsis  includes  corrections  both  higher 
order  in  u>T  and  higher  order  in  the  Dyson  expansion. 
Therefore,  ignoring  the  0[(JT)3)  corrections  higher  or¬ 
der  in  the  Dyson  expansion,  nth-order  decoupling  im¬ 
plies  that  DD  suppresses  the  effective  noise  strength  by  n 
powers  of  toT  where  w  is  a  characteristic  bath  frequency, 
rather  than  n  powers  of  eT  as  in  our  previous  analysis 
using  the  Magnus  expansion. 


/OO  7 

-t2)  Kap{u),  (172) 

and  Eq.  (11711)  becomes 

/OO  7 

2^  ^(<Sct(w)5/3(-w))s  Ka0(u)  +  ..., 


a, (3 


where 


Sa(w)  =  /  dt  e~iut  Sa(t), 


(173) 


(174) 


and  the  ellipsis  indicates  the  terms  higher  order  in  J . 
Defining  the  bath’s  spectral  function  J2^  i  by 

Ka0(co)  =  JlpA“  -  “>0,  (175) 

i 

our  expression  for  (the  square  of)  the  noise  strength  is 


B.  Universal  decoupling 

To  be  concrete,  consider  the  case  of  a  single  qubit  with 
noise  Hamiltonian 

H  =  I  (g>  Bo  +  ^2  &a®Ba.  (180) 

a—x,y,z 


For  a  sequence  of  ideal  zero- width  Pauli  operator  pulses, 
the  time-dependent  system  operator  in  the  interaction- 
picture  Hamiltonian  becomes 

<ra(t)  =  Fa(t)cra,  (181) 

where  Fa(t)  =  ±1  (+1  if  aa  commutes  with  Uc(t)  and 
— 1  if  a a  anticommutes  with  Uc(t)).  For  the  universal 
decoupling  sequence 


rj2  <  max  ^  J2(3  i(Sa(a;i)5/3(-Wi))s  +  •  •  •  .  (176) 

Thus,  speaking  loosely,  DD  is  effective  if  Sa(u)  is  sup¬ 
pressed  when  u  is  a  “typical  frequency”  where  the  bath 
spectral  function  has  support.  We  use  the  symbol  J2 

advisedly,  because  like  J  =  max  ||Fferr|| ,  scales 

linearly  with  the  strength  of  the  system-bath  coupling. 

The  operator  Sa(u> )  can  be  written  as  TSa(ujT),  where 
Sa  is  dimensionless.  Adapting  our  terminology  to  this 
correlation  function  analysis,  let  us  say  that  a  pulse  se¬ 
quence  achieves  nth-order  decoupling  if  the  first  n  terms 
in  the  Taylor  expansion  of  Sa(uiT)  vanish,  so  that 

Sa{w)  =  T  (Sa,n{uT)n  +  0[(uT)n+1})  .  (177) 


Uc{tDD)  =  ZIXIZIXI,  (182) 

these  functions  are 

Fx  =  (+  H - )> 

Fy  =  (H - 1 - ), 

Fz  =  (+--+);  (183) 

here,  for  example,  Fx  =  (+  4 - )  means  that  Fx  has 

the  value  +1  in  the  intervals  [0, 7o]  and  [ro,2ro]  and  has 
the  value  —1  in  the  intervals  [2t0,3t0]  and  [3to,4t0].  All 
three  functions  integrate  to  zero  over  the  interval  [0, 4ro] 
and  hence  achieve  first-order  decoupling.  Evaluating  the 
Fourier  transform 


Equivalently,  the  pulse  sequence  achieves  nth-order  de¬ 
coupling  provided 


dt  tmSa(t)  =  0 


(178) 


we  find 


Fa(ui) 


dt  e~iutFa(t), 


(184) 
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(-iw)Fx(u>) 

(-iuj)Fy(uj) 

(-iu>)F~(  w) 


(: r  -  1)(1  +  x  -  x2  -  x3)  =  -( x 2  -  l)2  =  4e-2i“T°  sin2(o;To)  =  4(wr0)2  +  . . . , 
(x  —  1)(1  —  x  +  x2  —  x 3)  =  —{x  —  l)(x4  —  l)/(x  +  1) 

2e~2iuiT0  tan(ci;To/2)  sin(2o;To)  =  2(wr0)2  +  . . . , 

(x  —  1)(1  —  x  —  x2  +  x3)  =  (x  —  l)(x2  —  l)2/(x  +  1) 

4 ie~2lUTo  tan(wro/2)  sin2(wro)  =  2*(wtq)3  +  . . . , 


(185) 


where  x  =  e~lulTa .  The  low-frequency  suppression  of 
Fy(uj)  is  stronger  by  a  factor  of  2  than  the  suppression 
of  Fx(u> )  because  the  period  of  Fy(t)  is  shorter  than  the 
period  of  Fx(t).  The  function  Fz(u> )  is  suppressed  by  a 
further  power  of  ujtq  because  Fz(t)  is  time-symmetric: 
Fz(4to  —  t)  =  Fz(t).  Indeed,  for  any  function  F(t)  satis¬ 
fying  F(T  —  t)=  F(t ),  we  have 


F(u)  =  dt  e 


-iut  =  f  dt  e-iu>t  F(j  _ 

Jo  Jo 

f  dt  e~iu(-T-t)  F(t)  =  e~iu>T F{—oj)\  (186) 
Jo 


thus  F(u>)  =  e  lUjT^‘2 FeVen  (to),  where  Feven(u>)  is  an  even 
function  of  u>,  and  F(oj)  =  0(w2)  if  F(0)  vanishes. 

The  time-symmetric  pulse  sequence 


Uc(tDD)  =  1X1ZIXWXIZ1X1,  (187) 


achieves  second-order  decoupling  because  all  three  func¬ 
tions  obey  F{t)  =  F(T  —  t): 


C.  Finite-width  pulses 

If  the  pulses  are  not  ideal,  then  first-order  decoupling 
will  not  be  exact.  For  example  if  the  pulses  have  nonzero 
width,  then  there  is  a  contribution  to  fj2  of  the  form 

[  {dtidt2)p-\tf  ^  1  (Sa{tl)S[)(t2))  g  (Ba{tl)Bfj(t2))  B 

=  ^2  ^aft!(Sa,Pw(Wi)S/3!Pw(-Wi))s;  (190) 

here  f  (dt) pw  denotes  integration  over  the  nonzero-width 
pulses,  and 

■5a,pw(w)  =  J  (dt)  pw  e~lut  Sa(t).  (191) 

For  a  sequence  of  N  pulses,  each  with  duration  S ,  we 
expect  Sa< pw(w)  «  Ar<5||S'Q||  for  uit  <C  1.  Comparing  with 
Eq.  (I177D.  we  conclude  that  for  a  pulse  sequence  that 
achieves  nth-order  decoupling  in  the  ideal  case,  pulse- 
width  corrections  are  small  provided 


Fx  —  (+  H - 1^+)) 

Fy  =  (-1 - 1 - 1 - h), 

Fz  =  (-| - 1 — | - 1-).  (188) 

Compared  to  the  four-pulse  sequence,  the  functions  Fx 
and  Fy  are  repeated  twice,  but  with  a  sign  flip,  so  the 
Fourier  transform  is  suppressed  by  an  additional  factor 
of  1  —  a;4  =  2 ie~2luJT°  sin(2wro)  «  4j(o;to).  The  function 
Fz  is  repeated  without  the  sign  flip,  so  its  Fourier  trans¬ 
form  is  multiplied  by  1  +  xA  =  2e~2lUT°  cos(2o;to)  «  2. 
Therefore  we  have 


NS/T  <  (uT)n  (192) 

where  a;  is  a  typical  bath  frequency.  This  is  similar  to  the 
criterion  we  found  using  the  Magnus  expansion,  except 
with  the  frequency  oj  now  replacing  the  operator  norm  e. 

For  an  Eulerian  sequence  with  reproducible  pulse  er¬ 
rors,  Sa(u)  vanishes  in  the  limit  ui  — >  0  (by  the  same 
reasoning  as  in  Sec.  IVI CD:  therefore  first-order  decou¬ 
pling  is  exact.  Furthermore  Sa(ui)  is  an  even  function  of 
uj  for  any  time-symmetric  pulse  sequence,  and  therefore  a 
time-symmetric  Eulerian  sequence  achieves  second-order 
decoupling. 


Fx(uj)  =  =  -16t0(o;to)2  +  . . . , 

Fy(u)  =  =  -~8t0(ujt0)2  +  . . . , 

Fz(cj)  =  =  -4to(cjt0)2  +  ■  ■  ■  ■  (189) 

Again,  different  types  of  low-frequency  Pauli  noise  are 
suppressed  by  different  (constant)  factors,  with  the  heav¬ 
iest  suppression  for  phase  ( i.e .,  <rz)  noise.  By  altering  the 
pulse  sequence,  the  stronger  suppression  could  be  applied 
to  <JX  or  Uy  noise  instead. 


D.  Gaussian  noise 

We  have  seen  that,  while  in  our  previous  analysis  we 
required  (3  =  max||ffB||  to  be  small  compared  to  1/tq 
in  order  to  get  a  useful  estimate  of  ??dd,  the  analysis 
based  on  bath  correlation  functions  can  provide  a  useful 
estimate  even  if  f3  is  large.  However  we  still  require  that 
J  =  max  1 1 /ferr  1 1  is  small  to  justify  neglecting  the  higher- 
order  corrections  in  the  Dyson  expansion  in  Eq.  (I170D .  In 
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some  cases  it  is  possible  to  go  further  and  express  these 
higher-order  corrections  in  terms  of  correlation  functions 
as  well,  thereby  obtaining  an  estimate  that  makes  sense 
even  if  the  system  qubits  are  coupled  to  bath  operators 
with  large  norm  ( e.g .,  the  quadrature  amplitudes  of  a 
bath  of  harmonic  oscillators). 

Consider,  for  example,  a  single  qubit  coupled  to  bath 
operators  whose  correlators  obey  Gaussian  statistics  in 
the  interaction  picture:  the  interaction-picture  Hamilto¬ 
nian  is 

#(*)  =  £*»  {t)®Ba(t),  (193) 

a. 

where  the  expectation  value  of  an  odd  number  of  bath 


operators  vanishes,  and  the  expectation  of  an  even  num¬ 
ber  of  bath  operators  is 

<B(l)B(2)...B(2n)> 

=  £  K(iu  h)K{i3,  i4)  ■  ■  -K&n-i,  hn)- 

contractions 

(194) 

Here  the  sum  is  over  the  ( 2n)\/2nn\  ways  to  divide  the 
labels  1,  2, ...  2 n  into  n  unordered  pairs,  and  we  use  the 
shorthand  B(i)  =  Bai(ti),  K(i,j)  =  ( B(i)B(j))B •  Thus 
terms  of  odd  order  in  the  Dyson  expansion  for  fj2  vanish, 
and  we  may  bound  the  (2n)-th  order  term  as 


f  dh...  dt2nT  (H(h)  ■  ■  ■  H(t2n ))  \  <  -J-  /  dt i  ...  £  £  \K(il,  *2)  ’  ’  '  K(i2n-1,  *2n)| 

,  '  ®  j  \  )‘  0  ai,...a2n  contractions 


(2n)! 


zsn  1  fT 

£  (2 K)n  =  —,  where  AT  =  -  J  dtds  £  |  (Ba(t)B0(s))  B  \  . 

contractions  ^  a,/3 


(195) 


To  derive  Eq.  (11951).  we  use  Eq.  (11941)  and  ||cra(t)||  =  1, 
and  we  note  that  the  value  of  \K(i,j)\  does  not  depend 
on  the  time  ordering  of  A  and  tj.  We  conclude  that, 
in  the  case  of  Gaussian  noise,  the  sum  of  all  corrections 
higher  than  quadratic  order  in  the  Dyson  expansion  can 
be  bounded  above  by 

OO 

£  Kn/n\  =  eK  -  1  —  K,  (196) 
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and  that  the  quadratic  term  provides  a  good  approxi¬ 
mation  to  the  effective  noise  strength  for  K  sufficiently 
small. 

Another  approach  to  analyzing  DD  is  to  use  the  Dyson 
expansion  and  to  also  expand  Ba(t)  in  powers  of  B^t, 
thus  obtaining  a  double  expansion  in  powers  of  JT  and 
(JT .  In  that  case  we  might  say  that  “nth-order  decou¬ 
pling”  is  achieved  if,  in  the  expression  for  the  interaction- 
picture  evolution  operator  U(T),  all  terms  of  order  Tm 
are  pure-bath  terms  for  m  =  1, 2, . . . ,  n.  For  the  case  of 
a  qubit  subject  to  pure  dephasing  noise  (B\  =  B2  =  0), 
it  is  shown  in  [13|,  [39J  that  in  this  sense  nth-order  de¬ 
coupling  can  be  achieved  by  a  sequence  of  X  pulses  with 
n  +  1  pulse  intervals,  where  the  pulses  are  nonuniformly 
spaced  in  time.  For  general  qubit  noise,  nth-order  de¬ 
coupling  can  be  achieved  by  a  sequence  of  nonuniformly 
spaced  X  and  Z  pulses  with  altogether  (n  +  l)2  pulse 
intervals  [Hflj]. 

The  corrections  higher-order  in  T  are  not  necessarily 
small  unless  both  f3T  <C  1  and  JT  <C  1  are  satisfied. 
However,  the  ideal  pulse  sequence  constructed  in  [59|  has 


the  property 

[  dt  tmFa(t)  =  0,  m  =  0, 1, . .  .n  —  1,  a  =  x,y,z. 

Jo 

_  _  (197) 

(The  sequence  in  [131,  |39(  has  this  property  only  for 
a  =  z.)  Therefore,  even  if  (JT  is  not  small,  we  can  use 
the  correlation  function  analysis  to  show  that  same  se¬ 
quence  also  achieves  nth-order  decoupling  in  the  sense  of 
Eq.  (I177D.  Therefore  DD  works  effectively  if  tuT  -C  1, 
where  a;  is  a  typical  bath  frequency,  provided  that  either 
JT  C  1  or  (in  the  case  of  Gaussian  noise)  K  <C  1.  The 
same  remark  applies  to  pure  dephasing  noise  for  the  pulse 
sequence  in  pdj.  |39} . 

E.  Concatenated  dynamical  decoupling 

Instead  of  using  the  Magnus  expansion,  we  can  analyze 
the  performance  of  concatenated  DD  sequences  using  the 
Dyson  expansion  and  bath  correlation  functions.  As  in 
Sec.  IX  A1  we  will  suppose  that  the  higher-order  terms  in 
the  Dyson  expansion  can  be  neglected,  and  will  focus  on 
the  lowest-oixler  term  Eq.  (11761).  The  objective  is  to  show 
that,  by  concatenating  k  times  a  pulse  sequence  that 
achieves  first-order  decoupling,  fc-th  order  decoupling  can 
be  achieved,  in  the  sense  that  Sa{oj)  =  0{(<jjT)k]. 

To  illustrate  the  idea,  consider  the  simple  pulse  se¬ 
quence  that  decouples  pure-dephasing  noise  for  a  single 
qubit: 


Uc(tDD)  =  XIXI, 


(198) 
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so  that  the  “level-1”  function  multiplying  az  in  the  in¬ 
teraction  picture  can  be  represented  as 

FP  =  (+-).  (199) 

When  we  concatenate  the  pulse  sequence,  is  replaced 
by  Fz2\  in  which  F-1^  is  repeated  twice,  but  with  a  sign 
flip  in  the  second  repetition: 

^J2)  =  (+  -  -+),  (200) 

and  for  higher-level  sequences  we  have 

-fJ3)  =  (+--  +  -  +  +-), 

F +  -  +  +  +  +  -  + 

(201) 

etc.  Evaluating  the  Fourier  transforms  of  these  functions, 

r2kTo 

FW(w)=  /  dt  e~iutFj;k\t),  (202) 

Jo 

we  see  that 

fP(u)  =  (203) 

and 

M  =  (l-x2*"1)^-1^^),  (204) 

where  x  =  e~lUT°,  and  hence 

F<n\w)  =  (-iw)-: vx1/2  (x1'2  -  x-1/2) 

k= 1 

=  2co~1x1/2  sin(tuT0/2) 

n 

xl[x2  (2*  sin  (2fc_2wro))  .  (205) 


The  leading  behavior  of  this  function  for  small  uitq  is 


Fjn)M  =  ro(f)"2n(”-1)/2  (Wr0)n  +  •  •  •  ,  (206) 


and  therefore  Eq.  (11761)  becomes 


42  <  rS)  (u,,r0fA  +  ■  ■  ■  , 

(207) 

where  we  neglect  corrections  both  higher  order  in  the 
Dyson  expansion  and  higher  order  in  frequency.  Naively, 
this  expression  for  the  effective  noise  strength  ijdd  is  op¬ 
timized  by  choosing  the  level  of  concatenation  n  to  be 
the  largest  integer  such  that  2"_1  (uitq)  <  1  where  u> 
is  a  “typical”  bath  frequency.  Note,  however,  that  for 
2n  (wto)  ~  1  the  higher-order  corrections  in  (wro)  mod¬ 
ify  J)dd  by  an  0(1)  multiplicative  factor.  Note  also  that 
2"ro  =  is  the  duration  of  the  level-n  pulse  sequence, 
and  thus  the  optimal  pulse  sequence  has  duration  com¬ 
parable  to  a  typical  inverse  frequency  of  the  bath. 

Other  concatenated  pulse  sequences  can  be  studied 
similarly.  Consider  for  example  the  universal  DD  se¬ 
quence.  We  have  seen  in  Eq.  (11851)  that  this  sequence 
suppresses  noise  asymmetrically  (the  best  suppression  for 
az,  the  worst  for  ax),  so  we  might  choose  to  alter  the  se¬ 
quence  at  higher  levels  to  provide  more  balanced  noise 
suppression.  But  if  we  do  not  do  that,  the  functions 
Pik\uj)  can  be  specified  by  augmenting  Eq.  (118511  with 


F £fe)M 
Fjfc)H 


(x4“~‘)S,!  (4i) cos  (41-Vt„)/2)  sin  (4‘-1(ojt0)) 

(l  -x4‘-'  +  (x4*-)2  -  (x-*-)3)  -  (x‘‘-)3/2 

(l  -x4‘-  -  (x‘‘-)2+  (x4‘-‘)3)  =  (x**-‘)3/2 


(i)  sin  (2-4fc~1(a;ro)) 
cos  (4fc_1(wT0)/2) 
(-2)  sin2  (4fc~1(ct’To)) 
cos  (4fe_1(wro)/2) 


Fik~1]  M, 


F^Kco), 


(208) 


for  Fxk\oj ),  where 

F^  (W)  =  (4fe  (iu;T0)  +  ■■■)  F^  (W),  (209) 


The  weakest  suppression  of  low-frequency  noise  occurs 


29 


and  hence 


to  (4fe*wr0)  H - 

k=  1 


To(i)nAn(n+l)/2  +  .  .  .  } 


(210) 


where  we  neglect  the  terms  higher  order  in  u>tq.  From 
Eq.  (11761)  we  obtain  the  estimate  of  the  noise  strength 


1/2 

4"<"«>/J  |  V  )  + 

<i,a,/3 


(211) 


Noting  that  the  universal  DD  sequence  has  length  R  =  4, 
we  see  that  Eq.  (12111)  resembles  Eq.  (11381).  but  with  the 
operator  norm  e  replaced  by  a  bath  frequency. 


XI.  CONCLUSIONS 

We  have  derived  upper  bounds  on  the  effective  noise 
strength  7/dd  for  DD-protected  quantum  gates,  in  terms 
of  the  parameters  of  a  Hamiltonian  noise  model.  From 
the  upper  bounds  on  the  noise  strength  we  can  extract  a 
noise  suppression  threshold  condition,  a  sufficient  condi¬ 
tion  for  DD-protected  gates  to  outperform  unprotected 
gates.  We  can  also  derive  an  accuracy  threshold  con¬ 
dition;  when  the  noise  parameters  obey  this  condition, 
scalable  quantum  computing  is  possible.  Our  results 
show  that  DD,  and  in  particular  concatenated  DD,  can 
improve  the  gate  accuracy  and  overhead  cost  of  fault- 
tolerant  quantum  computing. 

We  have  used  two  different  methods  to  quantify  the 
accuracy  of  DD-protected  gates.  From  the  Magnus  ex¬ 
pansion  in  the  toggling  frame  we  can  derive  an  expression 
for  7/dd  in  terms  of  the  operator  norm  of  the  noise  Hamil¬ 
tonian;  an  advantage  of  this  method  is  that  ijdd  does  not 
depend  on  the  state  of  the  bath.  From  the  Dyson  expan¬ 
sion  in  the  interaction  picture  we  can  derive  an  expression 
for  ?7dd  in  terms  of  the  frequency  spectrum  of  bath  corre¬ 
lations.  While  the  bath  frequency  spectrum  does  depend 
on  the  state  of  the  bath,  the  second  method  sometimes 
yields  useful  result  when  the  first  method  fails,  because 
the  norm  of  the  noise  Hamiltonian  is  too  large. 

Our  analysis  of  fault-tolerant  circuits  built  from  DD- 
protected  gates  applies  only  to  Hamiltonian  noise  models 
satisfying  suitable  assumptions.  For  the  Magnus  expan¬ 
sion  analysis  we  used  the  local-bath  model;  this  allows  us 
to  study  each  DD-protected  gate  individually,  ignoring 
noise  correlations  among  distinct  gates  being  executed 
in  parallel  at  the  same  time.  For  the  correlation  func¬ 
tion  analysis  we  used  an  even  more  artificial  model,  in 
which  the  state  of  the  bath  is  refreshed  after  each  DD- 
protected  gate.  This  assumption  allows  us  to  include 
non-Markovian  effects  during  the  DD  pulse  sequence  at 
each  protected  gate,  but  to  ignore  these  effects  when  the 
DD-protected  gates  are  composed  in  a  quantum  circuit. 


It  is  clearly  desirable  to  extend  our  analysis  to  models 
with  more  general  noise  correlations. 

Here  we  have  proposed  to  combine  DD  with  fault- 
tolerant  quantum  computing  straightforwardly,  by  re¬ 
placing  each  gate  in  a  fault-tolerant  circuit  by  the  cor¬ 
responding  DD-protected  gate.  Perhaps  more  sophisti¬ 
cated  ways  to  combine  DD  with  fault  tolerance  can  be 
found,  leading  to  further  gains  in  efficiency  and  accuracy. 
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APPENDIX  A:  REVIEW  OF  THE  MAGNUS 
EXPANSION 


Here  we  will  briefly  review  some  properties  of  the  Mag¬ 
nus  expansion  that  are  used  in  our  arguments.  For  a  more 
detailed  discussion,  see  [44j|. 

The  foundation  of  the  Magnus  expansion  is  this  theo¬ 
rem: 

Theorem  1.  , Suppose 

=M(i)e°W.  (Al) 


Then 


ri  00  td 

|n(<)  =  5;^adx(, ,[«<*)]. 


(A2) 


n— 0 


Here  the  {Bn}  are  the  Bernoulli  numbers  defined  by 

~  B 


t  =  £ 


n—0 


n  rt 

-j 

n! 


(A3) 


and  ad))  [A]  is  defined  by 

ads  [A]  =  [B,[B,[-  ■  ■  [B,[B,  A]}  ■  ■  ■]]]  (A4) 

(ad °b[A)  =  A,  and  ad)) [A]  for  n  >  1  contains  n  nested 
commutators).  The  series  converges  provided  ||f2(t)||  < 


Proof.  To  obtain  a  useful  expression  for  M(t)  = 
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[lteQ^)  e  we  first  evaluate 


d 

dX 

d_ 

dt  V  d\ 
d_ 
dt 
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,-A  Q(i) 


An  \  -Afi(i) 


0AO(t) 


dt 


f l(t)e 


-AO  (i) 


=  eAO(t) 


(eAn(t)ft(£)) 

d  n(t) 


dt 


a-A  n(i) 


=  -AO(t) 


dt 


3AO(t)\ 


H(t)e 


-Afi(t) 


_  \ 


Q(t) 


n= 0 


(A5) 


In  the  last  line  we  have  used  the  identity 


In  the  Magnus  expansion,  we  express  0(t)  = 

Qn(t),  where  fl„(f)  is  nth-order  in  M.  Using  this 
expansion,  Eq.  (|A2I)  becomes 

=  M(t ), 

1  n—  1  yy 

o(t)  =  ^-fsij)(t),  n>  2,  (All) 

i=i 

where 

(ra-l) 

adnii(t)adni2(t)  •  --adn..^)  \M(t)) ; 

(A12) 


00  \  n 

eXBAe-XB=J2^adnBlA},  (A6) 

n= 0 

which  can  be  verified  by  differentiating  both  sides  k  times 
with  respect  to  A  and  then  setting  A  =  0.  Expressing 
M(t)  as  the  integral  of  its  derivative,  we  find 


here  the  sum  is  over  nonnegative  integers  {«i>  *2,  ■  •  • ,  ij } 
satisfying  U  +  *2  H - +  ij  =  n  —  1.  We  see  that 

SP(t)  =  [Qn-1(t),M(t)],  (A13) 

and  that  S„ ^  for  j  >  1  can  be  expressed  as 


M(i) 


fdxA 

fL  p-An(t) 

Jo  dX 

dt  V  ) 

c?A  55  “adi 


■n(t) 
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n— 0 


n— 0 
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dt 


n(t) 


(^Ti)!ad^(t) 


dt 


n{t) 


(A7) 


n—j 
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2  <  j  <  n 


1.  (A14) 


The  relations  Eq.  (1A11I).  (1A13I).  (1A14I)  provide  an  algo¬ 
rithm  for  generating  the  terms  in  the  Magnus  expansion 
recursively,  and  we  use  these  recursion  relations  to  derive 
our  upper  bounds  on  the  higher-order  terms. 


Thus  we  have  shown  that  M(t)  =  Oadam 
where 


APPENDIX  B:  EVEN  MAGNUS  TERMS  VANISH 
FOR  A  TIME-SYMMETRIC  HAMILTONIAN 


71  —  0 


1)!' 


eA  —  1 
A 


which  is  inverted  by 


Here  we  prove  the  fact  that,  if  Hm(£)  is  time- 
symmetric,  all  even  Magnus  terms  vanish.  This  was  pre¬ 
viously  known  in  the  NMR  literature,  at  least  for  the  case 
of  a  piecewise  constant  Hamiltonian  [61  j . 


Therefore, 

!«(*>  =  Oi'ow  [«(*)] ,  (A10) 


Lemma  2.  If  Hm(T  —  t)  =  HM(t),  then  fln(T)  =  0  for 
all  even  n. 

Proof.  First  we  show  that  H(T)  is  an  odd  function  in 
A(t)  =  —iHM(t)  when  Hm{I)  (or  correspondingly  A(t)) 
is  time-symmetric  about  T/2.  Defining  Ajv  =  T/2N  for 
N  a  positive  integer,  the  evolution  operator  from  t  =  0 
to  t  =  T  can  be  written  as 


from  which  Eq.  (1A2I)  follows. 

Regarding  the  convergence  of  the  expansion,  we  note 
that  ||ada||  <  (2||B||)n,  and  that  the  series  expansion  of 
x/(ex  —  1)  converges  for  \x\  <  2-jt,  because  the  nearest 
poles  to  the  origin  in  the  complex  cc-plane  are  at  x  = 
±2n i.  Therefore  the  expansion  in  Eq.  (IA2I)  converges  for 
||2H(t)||  <  2n. 

□ 


U(T,  0)  =  lim  eA(T)AN  eA(T-AN)AN  _  _  .  eA(^+Ajv)A;v 
N — >-oo 

X  £>A(^—An)An  gA(A;v)AiVgA(0)Aiv 

=  lim  eA(0)AN  eA(AN)AN  _  _  _  eA(If-AN)AN 
N — xx) 

X  £A(^—An)An  .  .  .  gA(AA/-)AiVg^4(0)A7v 

’  (Bl) 
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where  in  the  second  equality,  we  have  used  the  time- 
symmetry  A(T  —  t)  =  Aft).  Taking  the  adjoint  of 
Eq.  (IB  11) .  and  noting  that  A(t)^  =  — Aft ),  we  find 

UUT  0)  =  lim  e~A(0)AN e~MAN)AN  . . .  e-A(^-AN)AAr 
V  ’  '  JV— >  oo 

^  g— —  Ajv)Ajv  .  .  .  g  —  A(Ajv)AjVg  —  A(0)An 

(B2) 


□ 


In  this  paper,  we  use  Lemma  [3]  in  three  ways.  In  one 
application,  we  consider  the  case  where  both  Hamiltoni¬ 
ans  are  time  independent,  and  conclude  that  (compare 
with  Eq.  (1421)1 


Thus  U*(T,  0)  has  the  same  form  as  U(T,  0),  except  for 
the  replacement  Aft)  — >  —Aft). 

Since  U(T,  0)  =  exp($2(T))  and  W(T,  0)  = 

exp  (— f 2(T)),  we  conclude  that  under  the  replacement 
Aft)  —>  — Aft ),  f2(T)  transforms  as  f2(T)  — >  — fl(T)  + 
i27r£,  for  some  integer  t.  In  fact,  since  the  integer  £  cannot 
jump  discontinuously  when  A(t)  is  smoothly  deformed,  £ 
must  be  a  constant  independent  of  Aft),  and  by  taking 
the  limit  A(t)  — >  0  we  see  that  i  =  0;  thus  f2(T)  changes 
sign  under  A{t)  — >  — A(f),  i.e.,  is  an  odd  function  of  A(f). 

In  general,  f ln[T)  is  an  integral  of  an  expression  con¬ 
taining  n  factors  of  Aft).  Thus,  f ln(T)  is  invariant  under 
the  replacement  Aft)  — >  — Aft )  for  n  even,  and  changes 
sign  under  this  replacement  for  n  odd.  Since  in  the  time- 
symmetric  case  12 (T)  changes  sign  under  Aft)  — >  —Aft), 
we  conclude  that  f2n(T)  vanishes  for  n  even.  □ 

APPENDIX  C:  ERROR  ESTIMATE  FOR  TIME 
EVOLUTION 


Here  we  prove: 

Lemma  3.  Suppose  that  the  time  evolution  operator  U(t) 
satisfies  the  differential  equation 

jU{t)  =  —iH(t)U(t)  (Cl) 

with  the  initial  condition  U(tf)  =  Uq,  while  U(t )  satisfies 

jU(t)  =  - iH(t)U(t )  (C2) 


with  the  same  initial  condition,  where  both  H  (t)  and  H (t) 
are  Hermitian.  Then 


\\U(t)^U(t)\\<  /  ds\\H(s)-H(s)\\. 


'to 


Proof. 


< 


< 


wm-um  =  \\um(t)-l-i\\ 

-i  J  ds  U(s)  (h(b)  -  H(s)^j  U(s)-1 
ds  U(s)  U(s)-1 

ds  H{s)  —  H(s) 


'to 


(C3) 


(C4) 


\\U(t)-U(t)\\<(t-t0)\\H-H\\.  (C5) 

This  inequality  allows  us  to  relate  the  effective  noise 
strength  ryo d  achieved  by  dynamical  decoupling  to  our 
bounds  on  the  terms  in  the  Magnus  expansion. 

In  another  application,  we  consider  H  to  be  H$  +  Hb, 
where  Hg  governs  the  ideal  system  dynamics  and  Hb 
governs  the  bath  dynamics,  while  the  Hamiltonian  for  the 
noisy  joint  evolution  of  system  and  bath  is  H  =  H+Hsb  , 
where  Hsb  is  responsible  for  the  noise.  Then  in  the  local- 
bath  model,  if  || -fossil  <  J  and  a  gate  is  executed  in 
time  ro,  Lemma  [3]  implies  that  the  norm  of  the  “bad” 
part  of  the  gate  is  bounded  above  by  Jtq.  Thus  we  may 
estimate  the  effective  noise  strength  in  the  absence  of  DD 
as  77  =  Jtq ,  as  in  Eq.  (l53l). 

In  the  third  application,  we  use  Lemma  [3]  to  estimate 
the  error  arising  from  pulses  with  nonzero  width.  We 
consider  H(t)  to  be  the  Hamiltonian  describing  the  ac¬ 
tual  DD  sequence  with  realistic  pulses,  and  H(t )  to  be 
the  idealized  evolution  for  zero-width  pulses,  where  both 
Hamiltonians  are  expressed  in  the  toggling  frame  deter¬ 
mined  by  the  ideal  sequence.  Suppose  that  there  are 
R  pulses,  and  that  each  realistic  pulse  has  support  in  a 
time  interval  of  width  5.  Both  H(t)  and  H(t)  can  be 
expressed  as  a  sum  of  a  bath  Hamiltonian  and  an  error 
Hamiltonian;  the  bath  Hamiltonian  cancels  in  the  dif¬ 
ference  H(t)  —  H{t),  and  we  suppose  that  for  both  the 
realistic  and  ideal  sequences  the  norm  of  the  error  Hamil¬ 
tonian  is  bounded  above  by  J  during  the  pulses.  Thus 
\\H(t)  —  H(t)\\  <  2  J  during  the  pulses  (a  total  duration  of 
RS),  while  H(t)  =  H(t )  outside  the  pulses;  thus  Lemma 
[3] implies  || U{T)  —  U(T) ||  <  2 R8J,  as  in  Eq.  (I149D. 


APPENDIX  D:  BOUNDS  FOR  EVEN  MAGNUS 
TERMS  IN  THE  TIME-SYMMETRIC  CASE 


We  want  to  generalize  the  argument  used  to  compute 
the  bound  for  D2  (T)  in  the  case  where  Hm(2)  is  time- 
symmetric  except  for  t  €  A.  To  do  this  for  higher-order 
terms  requires  a  formula  for  the  Magnus  terms  for  which 
all  the  multiple  time-integrals  are  explicit.  Such  a  for¬ 
mula  can  be  found  in  [62|  (for  n  >  2): 


f2n(T)  =  —  [  dt-\  . . .  [  dtnLn 
n  Jo  Jo 

x  [[. . .  [A{t\),  A(tf)\, . .  .],A(tn)]  (Dl) 
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where 

n—  1  1  n—l 

Ln  =  Yl  y(-1)*+1  E  n^H). 

1=1  l<ji<...<jn-i<nm=l 

(D2) 

The  coefficients  take  care  of  the  time-ordering  and  re¬ 
labeling  of  the  integration  variables.  For  n  even,  follow¬ 
ing  what  we  did  in  the  fl2(T)  case,  we  split  up  the  n  time- 
integrals  into  n  different  cases:  (1)  none  of  ti,  i  =  1, . . .  n 
are  in  A,  (2)  exactly  one  of  U  £  A,  (3)  exactly  two  of 
ti  £  A,  . . .,  (n)  exactly  n  of  t,  £  A.  Case  (1)  is  zero  from 
the  time  symmetry  of  HmO)  for  t  ^  A;  the  remaining 
cases  we  bound  by  first  bounding  the  nested  commutator 
and  Ln,  and  then  doing  the  time-integral. 

The  ( n  —  l)-nested  commutator  can  be  bounded  as 


\\[[...[A(t1),A(t2)],...},A(tn)]\\ 
<2n-2\\[A(tl),A(t2)}\\\\A(t3)\\...  \\A(tn)\\ 

<  2n~2  (4Je)en~2 

=  2n  Je11-1.  (D3) 


Putting  these  back  in  f 1„(T)  (n  even)  and  doing  the 
time-integrals,  we  find  that 


Uncoil 

<  4  (2n_1  -  1)  (2 nJen~1) 


Qa(T-A)"-1 


+  (  2  )  A2(T  —  A)"-2  +  . . .  + 


A" 


2n+1Jen~1 


(2"_1  -  1)  [Tn  -  (T  -  A)71]  .  (D7) 


In  the  first  inequality  above,  the  terms  in  the  brackets 
are  the  n—l  cases  for  choosing  the  times  t\, ... ,tn,  with 
at  least  one  being  in  A. 

One  can  check  that  Eq.  (ID7I)  agrees  with  Eq.  (110411  in 
the  n  =  2  case  and  with  Eq.  (I106D  in  the  n  =  4  case.  We 
also  see  that,  for  each  n,  fln(T)  is  of  order  ATn_1,  and 
thus  vanishes  in  the  limit  A  — >  0. 


APPENDIX  E:  S(nj)  COEFFICIENTS 


The  2ra-2  factor  in  the  first  line  comes  from  opening 
up  (n  —  2)-nested  commutators  using  submultiplicativ- 
ity  of  the  operator  norm.  The  (4Je)  factor  in  the  second 
line  comes  from  the  bound  for  ||  [A(fi),  A(f2)]  |  given  in 
Eq.  (|88l) .  The  coefficient  Ln  can  be  bounded  by  ignoring 
the  step  function  (i.e.  ignoring  the  time-ordering,  since 
we  do  not  have  the  details  of  A  anyway): 


Here  we  derive  bounds  on  the  Sn  ^  coefficients  found  in 
the  recursive  formulas  (Eqs.  (!82aP  (I82el)')  for  the  Mag¬ 

nus  terms. 

Lemma  4.  For  all  n  >  2,  1  <  j  <  n  —  1, 

\\S^(t)\\<f^J(2et)n-1,  (El) 


n—l 
\Ln\  <  E 


1 

7 


i=i 


E  i 

l<3i<32<---<3n~l<n 


n—l 


E 


i 

7 


(D4) 


The  binomial  factor  arises  from  counting  the  number  of 
terms  in  the  sum  over  jf.  we  pick  n  —  l  elements  from 
the  numbers  1  to  n  —  1,  and  arranging  them  in  ascending 
order  gives  a  single  choice  of  (ji,  j2, . . . ,  jn-i)  and  hence  a 
single  term  in  the  sum.  The  number  of  ways  of  choosing 
n  —  l  elements  from  n  —  l  distinct  numbers  is  given  by  the 
binomial  factor.  To  bound  the  remaining  sum,  consider 


n—l 


dx{  l+x)™-1  =  E 
1=0 
n—l 

=  E 

1=0 


n  —  1\  1 


.  -  l 


X—l 


T=0 


n  —  1\  1 


.  -  l 


(D5) 


Therefore,  we  have  that 

r*l 


iL"i  -i  <t(i+i)"~I“(V)« 


=  -  (2n_1  -  1)  . 


(D6) 


where  the  coefficients  are  defined  recursively: 

A(0)  =  1,  /!0)=  0,  n>  1,  (E2a) 

n-j  m  —  1  |  R  | 

fij)  =  2  E  E  n  >  2;  (E2b) 

here  the  {Bp}  are  the  Bernoulli  numbers,  defined  by 


x 

e*  -  1 


B , 


E.  rvP 


p= 0 


p< 


(E3) 


Proof.  We  will  prove  the  lemma  by  induction.  We  begin 
with  the  smallest  case  where  n  =  2,j  =  1: 

I|41)WII  =  II^1W,-^mW]|| 

<  [  ds  ||  HM(t)\  || .  (E4) 

Jo 

The  commutator  can  be  bounded  as  in  Eq.  ([88 1): 
\\[HM(s),HM(t)]\\  <  4 Je.  This  thus  gives  ||S'21)(t)||  < 
4 Jet.  Since  / ^  =  2,  this  can  be  rewritten  as  ||S'21^||  < 
4  Jet  =  f^\j{2et). 

For  a  given  n  >  3,  suppose  that  the  lemma  holds  for  all 
Sm ^  for  m  <  n,  l<p<m  —  1.  There  are  three  different 


J  <  e,  we  get 
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types  of  Sn'*: 

SP(t)  =  (t),-iHM(t)} 

Sin-1}(t)  =  2)(t) 


(E5a) 

(E5b) 


TL-J 

S%\t)  =  +  E 


m= 2 


for  2  <  j  <  n  —  2. 


(E5c) 


Note  that  the  last  case  occurs  only  for  n  >  4.  We  will 
bound  each  case  separately.  First,  for  Sn\ 

lls^’WII  <  2||nn_1(t)||||H«(t)|| 

s&E1#1  /‘ *11*11 

P=1  p ■  Jo 

(E6) 

Eq.  dUD  becomes  /E  =  2  p^-i)  fn-i  when  j  =  1; 

therefore  ||5i1^(t)||  <  fZj(2et)n~1. 

Next  we  bound 

ii^"-1)(t)[i<2||n1(t)ini^r12)(i)ii 

<  ft~!2)  J(2et)”-1  (E7) 


Eq.  (IE2I)  becomes  fZ  ^  =  2 fZZ  when  j  =  n  —  1; 

therefore  ||S,i"~Ei)ll  <  J(2et)ra_1. 

Lastly,  the  2  <  j  <  n  —  2  cases: 

Uncoil 

<2||n1(t)||||^:-11)(i)ll 

n-j  m-1  I  R  I  ,  rt  \ 

+  d-II^WIl)  iiEEE)ii 

m— 2  p=l  '^v'°  ' 

<  ft11)J(2etr~1 


The  expression  within  the  brackets  in  the  last  line  looks 
like  fn'1  in  Eq.  (|E2j).  except  we  need  to  add  in  the  m  =  1 
terms,  as  well  as  the  p  =  0  terms.  In  fact, 


n—j  m—  1 

E 

m=2  p—  1 

.  fU) _ ?\J!A f (°) f?-1  _ o v'  f(o) /-(j-1) 

•fn  z  0!l  •'i  *'n— 1  z  /  v  0!ra Jrn  ^n~rn 

m= 2 

=  /«-2/Ei1),  (E9) 

where  in  the  last  line,  we  have  used  the  fact  that  /Ei  = 
0.  Putting  this  into  ||sE(f)||,  and  using  the  fact  that 


l-^pl  Ap) 
n!m  m  Jn~m 


llEE)ll  <  ft11)J(2etr~1 

+  J(et)(2et)n~2  [/«  -  2 /, 
<  fP  J{2et)n~\ 

This  completes  the  induction. 


O-i) 

n— 1 


(E10) 

□ 


APPENDIX  F:  fn  COEFFICIENTS 


In  [1115  the  {/„}  were  shown  to  be  coefficients  in  the 
power  series  expansion  of  of  G~1(y)  =  fnDn ,  the 

inverse  function  of 


r  7 

X 

/  xy 

/  dx 

2+  - 

1  —  cot  —  ) 

Jo 

2 

V  2/ J 

(FI) 


Here,  we  will  provide  a  self-contained  proof  of  the  above 
claim.  It  suffices  to  show  that  the  coefficients  of  G_1  can 
be  written  in  the  form  Eq.  (l96)l.  with  fZ  defined  via  the 
recursion  relations  (|E2I). 

First,  we  prove  a  lemma  that  applies  to  a  general  func¬ 
tion  y(s): 

Lemma  5.  Suppose  the  smooth  function  y  =  G(s)  is 
monotonic  on  its  domain  and  satisfies  y( 0)  =  0.  Then 
G~1(y)  can  be  written  as  YZ?=i  fnVn  where 


fn  = 

n\ 


**>s 


g(s) 


(F2) 


s= 0 


andt  =  W)- 


Proof.  Since  y  is  monotonic  on  its  domain,  the  inverse 
function  G~1(y)(=  s)  exists  and  has  derivatives 


(F3) 


where  in  the  first  equality,  we  have  used  the  chain  rule 
of  differentiation:  ^  =  %  fs  =  (% )  fs  = 

Since  y  is  a  smooth  function  on  its  domain,  so  is  g(s) 
and  hence  all  derivatives  of  G-1(y)  exist.  We  can  then 
expand  G~1(y)  as  a  Taylor  series  about  y  =  0  and  write 
G ~1(y )  =  EZo  fnyn  I°r  some  coefficients  /„.  We  see 
that  fo  =  0  since  G_1(0)  =  0.  For  n  >  1,  the  Taylor 
coefficients  are  given  by 


fn 


1  dn 


y= o 


(F4) 


which,  upon  inserting  Eq.  (IF3I)  and  noting  that  y( 0)  =  0, 
immediately  gives  Eq.  (IF21).  □ 

For  our  purposes,  the  function  y(s)  is  given  in  Eq.  (IF1I). 
i.e.  y(s)  =  G(s)  which  is  smooth  and  monotonic  over 
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the  domain  s  £  [ — 27r,  27t] .  It  is  also  clear  that  y( 0)  = 
0.  Lemma  [5]  thus  tells  us  that  we  can  write  G~1{y)  = 
fnyn,  where  /„  is  given  in  Eq.  (jF2|  with 


9(s) 


=  2 + 1  (' _  cot  I)  ■ 


(F5) 


Lemma  6.  The  coefficients  fn  in  G  1(y)  =  Y^^Li  fnVn 
can  be  written  in  the  form  Eq.  (1961).  with  fn'1  defined 
according  to  Eq.  (1E21). 


Proof.  For  n  =  1,  the  index  j  in  Eq.  (1961)  can  only  take 
value  0,  so  /i  can  be  written  in  the  form  Eq.  (1M1)  if  we  set 
=  1.  To  handle  the  case  n  >  2,  we  use  Eq.  (IE3D  to 
expand  cot(s/2)  in  terms  of  Bernoulli  numbers,  finding 


s 

-cot 


=  B0+(^B1  +  ^Sj(is)+Y^^-(isy.  (F6) 


Noting  that  B0  =  1,  B\  =  —1/2,  B2j+i  =  0  for  j  >  1, 
Bij  <  0  for  j  >  1  and  B^+2  >  0  for  j  >  0,  Eq.  (IF6I) 
becomes 


3(s)  =  2  +  ^  (i-cot^)  =f2~^s3-  (F7) 

j=o  J ’ 

Using  this  series  expansion  of  <?(s),  we  can  rewrite  (IF2D 
for  fn> 2  as: 


fn 


1  ^  \Bj\  2n_1 

n2n— 1  ^  j!  (n  —  1)! 


(F8) 

We  omit  the  j  =  0  term  in  Eq.  (IF8I)  because  the  deriva¬ 
tive  of  a  constant  vanishes,  and  the  sum  over  j  termi¬ 
nates  at  j  =  n  —  1  because  higher-order  terms  vanish 
when  we  set  s  =  0.  Thus  /^>2  =  0,  and  by  comparing 


with  Eq.  (l96l)  we  define  fn'1  as 


fti)  — 

J  n 


2n-l 

(n-  1)! 


(F9) 


Now  we  need  to  show  that  the  {fn'*}  obey  the  recursive 
relation  (|E2j).  Using  our  definition  of  fn1  from  (IF9D.  the 


right-hand  side  of  Eq.  (IE2I)  can  be  rewritten  as 

n-j  m—l  ,  „ 

■Dji 


2  V  V  J£e1  f(p) 

^  ^  p\mJm  Jn~ 


m= 1  p—0 
n—j  m—l 


-i) 

— m 


2  I-^pI  2?ra  1 

^  p\m  (m  —  1)! 

m=l  p=0  ^  '  7 


m—l 


in- m—l 


(n  —  m  —  1)! 


ds 


ds, 

n— m— 1 


s=0 


s=0 


t-1 


=  _ _ V 

(n  -  1)!  ^ 

v  7  m—l 


n  —  1 
m 

n—m—l 


ds 


■»-!  m“1  I  R  I 

^  l£plsp 


p=0 


p! 


el”1 


s=0 

(F10) 


5=0 


The  expression  ^TsP)  Just  d(s)  ^  we  can  ex~ 

tend  the  upper  limit  of  the  sum  to  infinity.  We  can  indeed 
do  this,  because  in  the  equation  above,  the  expression 
is  differentiated  m—l  times  and  s  is  set  to  0.  Hence, 
higher-order  terms  in  the  power  series  expansion  of  g(s ) 
with  p  >  m  do  not  contribute.  Therefore, 

n-j  m—l  ,  „  , 

o  v  V  Ifz!  f (p)  ft#-1) 

Z^  Z^  nlmJm  Jn-m 

m=  1  p—0  ^ 


(n-  !) 


-1  W-J 

1)!  ^ 


n  —  1 


ds 


9  ds 


n—m—l 


2"-1  /n  -  1 

(n  -  1)!  ^ 

v  7  m—l 


9Ts 

n—l—m 


1 

m  -| 
S 

J-1 


s— 0 


s— 0 


s=0 


(FH) 


5=0 


Now,  for  any  differential  operator  P  satisfying  the  prod¬ 
uct  rule,  i.e.  V(xy )  =  V{x)y  +  xV(y)  (where  x  and  y 
commute),  T>n  has  the  binomial  expansion 

Vn(xy)  =  (")  PH*)]  [Vn~m(y)]  •  (F12) 

m— 0  '  ' 


Take  T>  =  x  =  s  and  y  =  sJ  1.  Then  (note  that  the 
m  =  0  term  is  zero), 


ds 


n—  1 


n— 1 

=  E 

3—0 


n  —  1 
m 


ds 


ds 


J-1 


s= o 


8= o 


(F13) 

Putting  this  into  (IF  1  ID  gives  exactly  the  expression  for 

#  mm-  □ 
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APPENDIX  G:  PURE-BATH  TERM  IN  SECOND 
ORDER  OF  THE  MAGNUS  EXPANSION 


APPENDIX  H:  NOISE  PARAMETERS  FOR 
CONCATENATED  DYNAMICAL  DECOUPLING 


Here  we  consider  the  case  where  the  toggling-frame 
Hamiltonian  has  a  decomposition  H(t)  =  Hb  +  Herl(t) 
such  that 

Hb  =  B0®I,  =  (Gl) 

O' 

where  Sa(t)  =  U%(t)SaUc{t)  and  the  operators  {Sq,}  are 
a  Hermitian  basis  for  traceless  operators  acting  on  the 
system  such  that 

tr  (SaSp)  =  0  (G2) 

for  all  a  ^  /3.  We  further  assume  that  each  pulse  either 
commutes  or  anticommutes  with  each  Sa,  so  that 

sa(t)  =  U'c(t)SaUc(t)  =  ±Sa,  (G3) 

and  hence 

HeiI(t)  =  Y/<;a(t)Ba®Sa  (G4) 

a 

where  (a(t)  =  ±1.  These  assumptions  are  true,  in  par¬ 
ticular,  for  an  n-qubit  system  if  each  Sa  and  each  pulse 
is  a  traceless  n-qubit  Pauli  operator.  We  will  show  that 
under  these  assumptions  the  second-order  term  in  the 
Magnus  expansion 

MT)  =  ~  f  dh  f  1  dt2  [Hfa^Hfa)]  (G5) 
*  Jo  Jo 

contains  no  pure  bath  term;  that  is,  trg(H2(T))  =  0. 

Because  [Bb,Bb]  =  0,  it  suffices  to  show  that 
the  system  trace  vanishes  for  [Hb,  Herl(t±)]  and 
[HeTI(ti),  HeTT(t2)]  for  any  ti,t2  £  [0,  T].  First  we  ob¬ 
serve  that 

[HB,HeiI(t)]  =  ^2Ca{t)[B0®I,Ba®Sa\ 

ot 

=  52Ca(t)[Bo,Ba]®Sa  (G6) 

ot 

has  vanishing  system  trace.  Next  we  note  that  if  the 
product  SaS/3  is  traceless  it  can  be  expanded  in  the  basis 
{S'o},  so  that 


For  the  analysis  of  concatenated  DD  in  Sec.  IVIII1  we 
considered  dividing  the  third-order  term  in  the  Magnus 
expansion  H3  into  a  pure  bath  term  and  a  remainder.  For 
that  purpose  we  may  use: 

Lemma  7.  Suppose  an  operator  O  has  a  decomposition 

O  =  l®B0  +  ^2sa®Ba,  (HI) 

O' 

where  both  terms  are  Hermitian  and  tr  ( Sa )  =  0  for  each 
a.  Then 

Poll  <  ||0||,  (H2) 

and 

<2|p||.  (H3) 

ot 

Proof.  To  derive  Eq.  (1H2I).  suppose  that  | ip)  is  a  normal¬ 
ized  pure  state  such  that  |(^>|.Bo|V’)|  =  Poll,  and  consider 
the  expectation  value 

(x|  ®  (ip\  Sa  0  B°^j  M 0  1^) 

J2sa(-iP\BaW))\x),  (H4) 

Q!  / 

where  |x)  is  also  a  normalized  pure  state.  The  right-hand 
side  of  Eq.  (El  is  the  expectation  value  in  the  state  |x) 
of  a  traceless  Hermitian  operator.  Unless  this  operator 
is  zero,  the  expectation  value  can  be  either  positive  or 
negative  depending  on  how  |x)  is  chosen.  By  choosing 
|x)  so  that  the  expectation  value  (J2a  Sa  ®  Ba)  in  the 
state  |x)  ®  \if)  is  either  zero  or  has  the  same  sign  as 
(I  ®  -Bo),  we  have 

|(0)|  >  |(I®  Bo) I,  (H5) 

and  Eq.  (IH2D  follows.  From  the  triangle  inequality, 


SaS0  =  5a0Ta  +  Y,  9«f 37^7  (G7) 

7 

(where  Ta  might  have  a  nonvanishing  trace).  Therefore 
HeIT(ti)HeTT(t2 )  =  "Y,  Cct(tl)Ca(t2)BaBa  ®  Ta  +  •  •  • 

Ot 

Prr(P#err(U)  =  Y  Ca  (*2  )  Ca  ftl  )■ Ba Ba  ®  Ta  H - 

Ot 

(G8) 

where  the  ellipsis  represents  terms  with  vanishing  system 
trace.  Thus  in  the  commutator  [HelI(ti),  Herr(t2)]  the 
terms  proportional  to  Ta  cancel,  and  what  remains  has 
vanishing  system  trace,  as  we  wished  to  show. 


||XP«®B“H  =  ||O-I®B0||  <  ||O||  +  ||I®B0||  <  2||0||, 

(H6) 

which  proves  Eq.  (1H3I).  □ 

The  inequality  Eq.  (IH31)  is  tight  if  we  do  not  restrict 
the  dimension  of  the  system,  but  if  the  system  is  a  qubit 
(two  dimensional),  it  can  be  improved  to 

|l5>Q®Fq|  <  ||0||.  (H7) 

Ot 

For  a  qubit,  there  is  an  anti-unitary  time-reversal  opera¬ 
tor  T  :  \ip)  — *  tJy\if)*  such  that  T^aaT  =  —aa.  Suppose 
\if)  is  a  normalized  pure  state  such  that  \{'f’\'f2acra  ® 
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Ba\if)\  =  ||  La  Va®Ba\\.  By  applying  T <g> I  if  necessary, 
we  can  choose  | ip)  so  that  (%2a  cra  <g>  Ba)  and  (I  ®  B0) 
have  the  same  sign  (unless  (I  ®  Bq)  =  0).  Therefore 


If  the  norm  of  the  sum  A  +  B  is  not  too  large,  we  can 
use  Lemma  [8]  to  show  that  A  is  close  to  B  when  eA  is 
close  to  eB .  For  example,  suppose  that 


l(0)|  > 


a  a  ®  Ba 

OL 


(H8) 


||7l  + -B||  <  e+,  W-A  —  B ||  <  e_. 


Then  Lemma  [8]  implies 


and  Eq.  (IH7j)  follows. 


||A-B||<c(e+,e_)||eA-eB 


(18) 

(19) 


APPENDIX  I:  RELATING  DISTANCE  BETWEEN 
OPERATORS  TO  DISTANCE  BETWEEN  THEIR 
EXPONENTIALS 


where 


c(e+,e_) 


/2sinhe_/2\ 

e_/2  J 


-1 


(110) 


Here  we  prove: 

Lemma  8. 

||eA  —  eB||  >  2\\A~  B\\ 

^2 exp  Q||A  +  H||^  sinh  Q||A-  .  (II) 

Proof.  Expanding  the  exponentials,  we  obtain 

OO 

eA  _  eB  =  A  -  B  +  ^  ±  [An  -  Bn] ,  (12) 

n—2  U' 

and  therefore 

OO  1 

\\eA-eB\\>\\A-B\\-Yl-Mn-Bn\\.  (13) 

'  n\ 

n= 2 

Defining 

N  =  \{A  +  B),  M=\{A-B),  (14) 

we  have 

An  —  Bn  =  (N  +  M)n  —  (N  —  M)n,  (15) 

and  when  we  apply  the  binomial  expansion  to  ( N+M)n  — 
(N  —  M)n  the  terms  even  order  in  M  cancel.  There  are 
^(m)  terms  °f  order  m  in  M  for  m  odd,  each  with  an  op¬ 
erator  norm  bounded  above  by  ||M||m||7V||n_m;  therefore 

\\An^Bn\\  <  2  J2  (n)\\M\\rn\\N\\n-m 

odd  m  ^  ' 

=  (||JV||  +  ||M||)"-(||JV||-||M||)n. 

(16) 

Thus  we  find 

00  i 

^_|!A"-^||<exp(||A||  +  ||M||) 

n= 2 

—  exp  (||iV||  —  ||  M  ||)  —  2||M|| 

=  exp  (|| A/’ll)  •  2  sinh  (||M||)  —  2||M||,  (17) 

and  substituting  into  Eq.  m  yields  Eq.  (HU).  □ 


For  example,  if  e+  =  e_  =  0.3,  we  find  c(e+,  e_)  =  1.20. 

APPENDIX  J:  BATH-STATE-DEPENDENT 
NOISE  STRENGTH  AND  DYSON  EXPANSION 

In  the  local-bath  model,  the  noisy  operation  applied 
at  the  circuit  location  a  is  a  unitary  transformation  Ga 
acting  jointly  on  the  system  and  bath.  We  may  express 
Ga  as  the  sum  of  a  “good”  part  Qa  =  Ga  ®  Ba  (where 
Ga  is  the  ideal  gate),  and  a  “bad”  part  Ba  =  Qa  —  Ga- 
The  accuracy  threshold  theorem  proved  in  [3l|,  [33j  es¬ 
tablishes  that  quantum  computing  is  scalable  provided 
the  noise  strength  fj  is  smaller  than  a  critical  value  rjo- 
For  this  purpose,  the  noise  strength  may  be  defined  in 
the  following  way.  Recall  that  we  model  a  noisy  prepara¬ 
tion  of  a  qubit  as  an  ideal  preparation  followed  by  noisy 
Hamiltonian  evolution  for  a  prescribed  period.  There¬ 
fore,  we  may  assume  that  the  initial  state  of  the  system 
at  the  very  beginning  of  a  quantum  computation  is  ideal, 
and  that  the  initial  state  of  the  system  and  bath  is  a 
product  state 

(J1) 

It  is  convenient  to  introduce  a  reference  system  R  that 
purifies  the  initial  state  of  the  bath;  then  the  initial  state 
of  system,  bath,  and  reference  system  is  a  pure  state 

I ^SBr)  —  I'f's')  ®  (J2) 

where 

P°b  =  (J3) 

Now  consider  a  quantum  circuit  acting  on  the  initial  state 
an(f  fef  denote  a  set  of  r  locations  in  the  cir¬ 
cuit.  Let  [7bad  (Xr )  denote  the  transformation  that  results 
if  we  place  the  noisy  gate  Ga  at  each  location  a  filr  and 
place  the  bad  part  Ba  at  each  location  a  €  I,.;  UhaA(Ir) 
acts  trivially  on  R.  We  may  say  that  the  noise  strength 
is  fj  if 

||[/bad(Tr)|$°)||  <  fjr 
for  any  set  Zr  of  r  locations  [U] . 


(J4) 
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Since  each  Ga  is  unitary  and  therefore  has  operator 
norm  1,  the  submultiplicative  property  of  the  norm  im¬ 
plies 


||[/bad(Ir,)|<f>°)||  <  ||C/bad(Jr)||  <  1]  ||6q||.  (J5) 

ae  Ir 

Therefore,  we  may  choose  the  noise  strength  to  be 

fj  =  max||6a||,  (J6) 

a 

which  does  not  depend  at  all  on  the  initial  state  of  the 
bath  [3lj|.  We  used  this  definition  for  the  analysis  in 
Sec.  HVlIVIIIl  based  on  the  Magnus  expansion,  of  the  ef¬ 
fective  noise  strength  achieved  by  dynamical  decoupling. 

However,  for  the  analysis  of  the  effective  noise  strength 
in  Sec.  IS  based  on  bath  correlation  functions  and  the 
Dyson  expansion,  we  use  a  different  definition  of  fj  that 
does  depend  on  the  initial  state  of  the  bath.  To  state  the 
new  definition  simply,  it  is  convenient  to  put  a  further 
limitation  on  the  noise  model  that  was  not  needed  in  the 
Magnus  expansion  analysis  —  we  assume  that  the  state  of 
the  bath  is  discarded  at  the  end  of  each  circuit  location, 
and  replaced  by  a  fresh  bath  state  at  the  beginning  of 
the  next  location. 

We  admit  that  this  new  more  restricted  noise  model 
is  even  more  artificial  than  the  local-bath  model  we 
analyzed  previously  using  the  Magnus  expansion.  We 
adopted  the  local-bath  model  so  that  we  would  not  need 
to  consider  spatial  correlations  in  the  noise  at  any  given 
time,  and  could  therefore  study  the  efficacy  of  the  DD 
pulse  sequence  for  each  circuit  location  individually.  But 
the  local-bath  model  admits  non-Markovian  effects,  tem¬ 
poral  noise  correlations  arising  from  the  bath’s  “mem¬ 
ory.”  The  theorem  in  |3lj,  with  the  threshold  condition 
expressed  in  terms  of  the  noise  strength  defined  as  in 
Eq.  (|J6l) ,  applies  to  non-Markovian  noise  (and  can  in  fact 
be  formulated  in  a  more  general  way,  so  that  the  local- 
bath  assumption  is  not  really  needed).  Therefore,  by 
using  the  Magnus  expansion  to  estimate  the  norm  ||£>a|| 
achieved  by  DD  at  each  location,  we  derived  a  threshold 
condition  applicable  to  a  noise  model  with  limited  spatial 
correlations  but  unlimited  temporal  correlations. 

The  situation  gets  more  complicated  if  our  characteri¬ 
zation  of  the  noise  depends  on  the  state  of  the  bath,  which 
evolves  during  the  computation.  The  theorem  in  [331  ex¬ 
presses  the  threshold  condition  in  terms  of  bath-state- 
dependent  correlation  functions,  under  the  assumption 
that  the  noise  is  Gaussian.  But  extending  that  theorem 
to  provide  an  expression  for  the  DD-protected  effective 
noise  strength  in  a  fully  non-Markovian  setting  is  a  tech¬ 
nically  daunting  problem  that  we  have  not  solved.  We 
will  settle  instead  for  a  rather  perverse  compromise,  in 
which  the  effects  of  the  bath’s  memory  are  included  in 
our  analysis  of  the  DD  pulse  sequence  at  each  circuit  lo¬ 
cation,  but  are  assumed  to  be  negligible  when  we  stitch 
the  DD-protected  gates  together  in  a  quantum  circuit. 

Under  the  assumption  that  the  bath’s  state  is  refreshed 
between  consecutive  circuit  locations,  the  noisy  operation 


at  location  a  is  applied  to  a  product  state,  where  the 
initial  state  pB,a  of  the  local  bath  for  location  a  does  not 
depend  on  the  noisy  operations  applied  at  earlier  circuit 
locations.  Under  this  assumption,  Eq.  (|J4l)  is  satisfied  if 
we  define 


rj  =  max  \\Ba  (I’F)  ®  |$Q)) 

a, It) 


(J?) 


where  |<f>a)  is  a  purification  of  pB,a,  and  the  maximum 
is  over  all  circuit  locations  and  over  all  pure  states  of 
the  system.  In  terms  of  the  interaction-picture  operator 
applied  at  location  a, 

Ua  =  QlGa  =  Gl(Qa  +  Ba)  =  la  +  glBa,  (J8) 


we  may  write  rj  as 

fj  =  max  (ua  -  IQ)  (I’F)  0  |<f>a)) 
or  equivalently 

o'1  = 

=  max(2Ia  -  Ua  -  uA  , 

a,|®>  \  / 


(J9) 


(J10) 


where  (•)  denotes  the  expectation  value  in  the  state  |T)  g) 
|<f>a).  This  is  the  formula  used  in  Sec.  Ixl 
For  a  location  with  duration  T,  the  interaction-picture 
time-evolution  operator  is  given  by  Dyson’s  formula 


U(T)  =  Tex  p 


(Jll) 


where  T  denotes  time-ordering  and  H(t )  is  the 
interaction-picture  Hamiltonian,  which  obeys  ||H(f)||  = 
|! Herr  ||  <  J-  Expanding  the  exponential,  we  find 

OO 

U(T)=I+J2Un(T),  (J12) 

71=1 

where 

Uu(T)  =  J  dh  ■■■  dtnT  (H(U)  •  •  ■  H(trSj  , 

(J13) 

and  hence 

\\Un(T)\\  <  VlH(f1)-..H(frt)||  <  (J14) 

n!  n\ 

Similarly,  W(T)  has  the  expansion 

OO 

U\T)=I+J2ukn  (J15) 

n=  1 

where 

ut(T)  =  ^-j  dh--- dtnr  (h(u) •  •  •  H(tnj) ; 

(J16) 
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here  T'  denotes  reverse-time  ordering,  and  again 

(, JT)n 


ii^tcnii  < 


n! 


(J17) 


Noting  that  Ui(T)  +  U\ (T)  =  0,  we  find 
(21 -U{T)-U\T)\ 

OO 

<  -  (u2(T)  +  U\{T))  +  £  || Un(T)  +  Ul(T)\\ 

n= 3 

<  -  (&2{T)  +  U\{T))  +  2^  (JT)n 


so  that 


U2(T)  +  U\(T) 


iy  dtidt2  (H(ti)H(t2)  +  H(t2)H(ti)^ 


n= 3 

<  -  (U2{T)  +  U%{T))  +  2  (eJT  -  1  -JT-  i(JT)2 

(J18) 

To  evaluate  the  expectation  value  of  U2(T)  +  U\{T),  we 
observe  that 


dtidt2H(ti)H(t2). 


Finally  we  may  express  the  noise  strength  as 


rj2  =  max  J  dtidt2  ( H(t\)H(t2 ) 


2  (  eJT  —  1  —  JT  —  -(JT)2 


T 


(H{tx)H(t2))  +  T'  (HitJHfaj) 


=  H(t1)H(t2)  +  H(t2)H(t1), 


(J19)  as  in  Eq.  (11701). 


(J20) 


(J21) 
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